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PREFACE 


This monograph is intended as an exposition of some 
central results on irrational numbers, and is not aimed at 
providing an exhaustive treatment of the problems with 
which it deals. The term “irrational numbers,” a usage 
inherited from ancient Greece which is not too felicitous 
in view of the everyday meaning of the word “irrational,” 
is employed in the title in a generic sense to include such 
related categories as transcendental and normal num- 
bers. 

The entire subject of irrational numbers cannot of 
course be encompassed in a single volume. In the selec- 
tion of material the main emphasis has been on those as- 
pects of irrational numbers commonly associated with 
number theory and Diophantine approximations. The top- 
ological facets of the subject are not included, although 
the introductory part of Chapter I has a sketch of some 
of the simplest set-theoretic properties of the irrationals as 
a part of the continuum. The axiomatic basis for irra- 
tional numbers, proceeding say from the Peano postulates 
for the natural numbers to the construction of the real 
numbers, is purposely omitted, because in the first place 
the aim is not in the direction of the foundations of math- 
ematics, and in the second place there are excellent treat- 
ments of this topic readily available. 

vu 
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The customary organization of a book with related sub- 
jects grouped together has been modified in part by con- 
sideration of the degree of difficulty of the topics, proceed- 
ing from the easiest to the most difficult. For example, al- 
most all the theorems on irrational numbers in Chapter II 
are implied by the stronger results of Chapter IX, but, 
whereas Chapter II requires only calculus and the barest 
rudiments of number theory for understanding, Chapter 
IX presupposes some basic results on algebraic numbers 
and complex functions. The first seven chapters are dis- 
tinctly easier reading than the last three, with fewer pre- 
requisite results needed and less mathematical maturity 
required of the reader. The chapters are for the most part 
independent of one another and so can be read separately; 
the major exception to this statement is the use in Chapter 
VI of some results from Chapter V. 

The only knowledge required of the reader beyond quite 
elementary mathematics is some algebraic number theory 
in Chapters III, [IX and X, and some function theory in 
Chapters VI, VIII, [IX and X. Most of the results needed 
are well-known theorems, central to the mainstream of 
mathematics, and complete references are given to stand- 
ard works. In those few instances where the prerequisite 
material is at all special, it has been included in the 
text. 

The books by Hardy and Wright, Koksma, Perron, and 
Siegel listed on page 157 have been very helpful, and I have 
made free use of these excellent sources. Further source 
material is listed in the notes at the ends of the chapters. 
These references, along with the remarks in the Notes, 
may be taken or left alone at the reader’s choice. Some 
further results beyond the scope of this book are also listed 
in the Notes; however, as any expert in the subject will 
readily see, I have not attempted to be either systematic 
or complete about this. For the convenience of the reader 
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there is appended a list of notation and a glossary on pages 
151 to 156. 


* * * 


Substantial improvements in the book have resulted from 
discussion of many points with my colleagues at the Uni- 
versity of Oregon, and from bibliographic suggestions by 
Professor C. D. Olds. I am also indebted to the Edito- 
rial Committee of the Carus Monographs for help in remov- 
ing several errors and obscurities. But especially I wish to 
acknowledge my indebtedness to Professor H. 8. Zucker- 
man who has been actively interested in this project from 
the start. Discussions with him during the early stages in- 
fluenced markedly the final versions of Chapters I and V. 
In addition he has read the manuscript very thoroughly 
and critically. However I did not invariably follow the 
suggestions of these friendly critics; so the responsibility for 
the shortcomings of the monograph is entirely mine. 


Ivan NIVEN 
University of Oregon 
July 1956 
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CHAPTER ] 


RATIONALS AND IRRATIONALS 


1. The preponderance of irrationals. Our general 
intention in this book is to characterize, classify, and ex- 
hibit irrational numbers in various ways, not only in the 
framework of real numbers but also in the larger setting 
of complex numbers. We do not examine the axiomatic 
foundations of our subject, preferring simply to take for 
granted the following basic classification. A rational num- 
ber is one that can be put in the form h/k, where h and k 
are integers with k ~ 0. Real numbers like +~/2 which 
are not rational are said to be irrational. t 

The first three sections of this chapter are devoted to 
some observations on the rational and irrational numbers 
regarded as point sets on the real line. This set-theoretic 
analysis is rather cursory, and is not typical of the general 
line of thought of this monograph, which is more number- 
theoretic in character. 

We begin by drawing attention to the overwhelming 
preponderance of irrationals over rationals. This cannot 
be established by a simple count, since both the rational 
numbers and the irrational numbers constitute infinite 
sets. To make a comparison of these two sets, we think 

t In addition to definitions and explanation of terminology given 
in the text, there is a Glossary and List of Notation at the end of the 
book. 
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of the real numbers as points on a line in the customary 
fashion in coordinate geometry. Any interval on this line, 
for example the interval from 3 to 5, or all x satisfying 
3 <2 Ss 5, covers all rational and irrational points in the 
interval. What we will establish is that we can create a 
set of intervals which cover all the rational points on the 
real line, and yet the total of the lengths of the intervals 
is arbitrarily small. The irrational numbers do not pos- 
sess this property. 

A set S of real numbers is said to have measure zero if it 
is possible to cover the points of S with a set of intervals 
of arbitrarily small total length. To give a simple exam- 
ple, the set of positive integers has measure zero as can 
be seen as follows. Enclose the integer 1 in the interval 
(1 — ¢/2, 1+6/2), the integer 2 in the interval (2 — ¢/4, 
2-+ ¢/4), -+-, in general the integer n in the interval 
(n — €/2", n+ ¢/2"). Hence the positive integers are 
covered by intervals of total length e+ «/2 + «/4+--: 
= 2e, which can be made arbitrarily small. 

Next, we say that almost all real numbers in an interval 
have a certain property P if the set of real numbers (or 
points) lacking the property P has measure zero. For ex- 
ample by the previous paragraph we may say that almost 
all positive real numbers are non-integers. This is a spe- 
cial case of the following result. 


THEOREM 1.1. Almost all real numbers are irrational. 

Proof. First we prove that the positive rational num- 
bers in the unit interval (0, 1) have measure zero. These 
numbers are the rationals h/k with 1 S$ h S k, and we en- 
close each such rational in an interval 


(1.1) (h/k — /k3, h/k + ¢/k?). 


We could avoid unnecessary duplication of the rationals 
by imposing the restriction that A and k be relatively 
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prime, but the present proof is correct with or without 
this restriction. These intervals (1.1) have total length 
which is 


yy eee a 


k= h=l kat ? k=2 a ar) 
ia 1 1 
= 26+ 2¢( ~~ 2) = 46 


which can be made arbitrarily small by suitable choice of e. 
(This argument can be shortened by using 2k~? = 17/6.) 

Next we extend this covering procedure to all positive 
rationals by observing that the reciprocal of any h/k < 1 
is k/h = 1 which can be enclosed in an interval 

(k/h — ¢/k8, k/h + €/k*). 

The estimate of the total lengths of these intervals is iden- 
tical with that above, and so can be made arbitrarily 
small. The proof of the theorem is completed by the ob- 
vious extension to the negative rational numbers and zero. 

It is interesting to note a specific irrational number 
not covered by the intervals (1.1) for suitable «. Such a 
number is +/2/2, as can be seen in the following way. 
First, the irrationality of ~/2 implies that, for any inte- 
gers h and k, |k? — 2h?| 21. Hence for any positive 
rational h/k S 1 we have 


Vf? 4 k? — Qh? = 1 
2 kl JE I\| /2 
Qh? (> -) Qk? (S 1) 
2 a 2 = 
1 1 
—_—_2—: 
4k? ~ 4k 


Thus +/2/2, thought of as a point on the real axis, has 
distance from the point h/k greater than 1/4k°, and so 
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4/2/2 is not in any interval (1.1) provided we choose ¢€ 
suitably small, say e = 1/4. 


2. Countability. A set is said to be countable (or de- 
numerable) if it can be put in one-to-one correspondence 
with the positive integers 1, 2, 3, ---. 


THEOREM 1.2. The rational numbers are countable. 

Proof. First we give a rule for ordering the positive 
rational numbers h/k, assuming that (h, k) = 1 to avoid 
duplicates. Say that A;/k, precedes ho/k2 in case either 
of the following conditions (a) or (b) holds: 


(a) tk, < he + ke; 
(b) hy + ky = hp + ke and hy < ho. 


This principle gives us an ordering of the positive ra~ 
tionals beginning thus: 


Pizisie234a1 sb 


121314382161 


This sequence, which includes every positive rational, can 
be put into one-to-one correspondence with the positive 
integers. The extension to all rationals can be obtained 
by inserting the negatives and zero in some such fashion 


as 
1 11 i 2. 2 1 1 


O,=) --> =) — a = SSS ee se 8 


1 1 2 21 #13 3 

THEOREM 1.3. Any countable set S of real numbers, say 
Q, Ag, Gg, --*, has measure zero. 

Proof. The argument used on the positive integers in 
§ 1 can be employed. That is, enclose a, in the interval 
(dn — €/2", an + €/2") for n = 1, 2, 3, --+-. Thus the 
whole set S is enclosed in intervals of length <2e, and 
this is arbitrarily small. 
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It may be noted that Theorem 1.1 is implied by Theo- 
rems 1.2 and 1.3. 


TuroreM 1.4. The irrational numbers are not countable. 
A fortiori the real numbers are not countable. 

Proof. Suppose the irrational numbers were countable, 
say Y1, Y2, Y3, °'*. But the rationals are countable by 
Theorem 1.2, say 71, T2, 73, '::. Thus we could inter- 
sperse these two sequences to get a single sequence rj, ¥1, 
Te, Y2, 73, Y3, °°°. Thus the reals would be countable and 
so would have measure zero by Theorem 1.3. But this 
would say that the whole real] line can be covered by a 
set of intervals of arbitrarily small total length, a contra~- 
diction from which the theorem follows. (The proposi- 
tion that the whole real line cannot be covered by a set of 
intervals of arbitrarily small total length is in a sense in- 
tuitively clear; a rigorous proof, which we do not wish to 
go into here, involves a more detailed study of the to- 
pology of the real line.) 


3. Dense sets. A set S of real numbers is said to be 
everywhere dense in an interval if, given any two real num- 
bers a and @ in the interval, say with a < 8, there is a 
number s in S such that a < s < £. 


TaerorEM 1.5. The rationals are everywhere dense; the 
trrationals are everywhere dense. 

Proof. Let a and 8 be any two real numbers, with 
a <8. By the Archimedean property (cf. glossary) there 
is a positive integer n such that n(@ — a) > l or B-—a 
> 1/n. Choose the integer m to satisfy m < 8B S$ m+ 1. 
Thus we have 

1 om+i1 1 


a<B--s 
n n n 


m 


m 
and — <&8, 
n 


so that m/n is a rational number between a and £. 
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To obtain an irrational number between a and £8, use 
the Archimedean property again to select a positive inte- 
ger k so that k(8 — m/n) > +/2. Hence we can write 


m 2 
ge ge, 
n k n 


so that m/n + +/2/k is an irrational number lying be- 
tween a and £. 


4, Decimal expansions. We shall employ constantly 
the square-bracket notation, [a], to denote the greatest 
integer not exceeding the real number a. In other words, 
[a] is the unique integer m satisfying the inequality m S$ 
a<m-+41. In some circumstances it is convenient to 
have the least integer not less than a: that is, the unique 
integer n satisfying n—1<asn. This can be ob- 
tained with the square-bracket notation; thusn = —[— a]. 
By the fractional part of any real number a is meant the 
difference a — [a], which will be denoted by (a). 

It is well known that the fractional part of any real 
number a can be represented as an infinite decimal ex- 
pansion to base 10; thus 


(a) = a — [a] = .a, 0203 «- 


This decimal expansion is unique except that a so-called 
terminating decimal (one with an infinite succession of 
zeros) can be expressed with an infinite succession of 
nines, as for example 


0.42 = 0.42000--- = 0.41999---. 


The representation of a real number by an infinite 
decimal has been generalized by Cantor in the following 
way. 
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THEOREM 1.6. Let aj, dg, a3, --+ be a sequence of posi- 
tive integers, all greater than 1. Then any real number a is 
uniquely expressible in the form 


am Cz 
(1.2) a =) + > ——— 
t=] 210Q °° * Ay 
with integers c; satisfying the inequalities 0 Sc; S a; — 1 
for allt 2 1, and c; < a; — 1 for infinitely many 7. 
Proof. It will be convenient to have the identity 


a 


On+i — 1 
(1.3) > ———— = 1, 
t=1 On41An42 °° ° An+i 
which is readily established by the following observation. 
The sum of the first & terms of the series has the value 


1 
1 —- ————_ 
On414n+2 °°° On+k 


which tends to 1 as k tends to infinity. 
We define the integers co, ¢1, C2, -++ and also a sequence 
of real numbers aj, a2, a3, ++ by the equations 


Co = [a], a =a— Co 
(1.4) 
c; = [a;,a,], Os41 = Aa; — Ci, i= 1,23, ++. 


Thus a; is the fractional part of a;_,a;_1, so that 
(1.5) 0sa; <1, t= 1,2,3,--°. 


Multiplying by a;, we get 0 S a,a; < a,, and, since c; is 
the integral part of a,a;, we conclude that 0 S ¢; S$ a; — 1, 
as required by the theorem. 
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Next we use equations (1.4) and mathematical induc- 
tion to observe that 


(1.6) 
a= b+ ay =o + — tS a cote a ea oa 


ay ay a1QQ a1Q2 


n 
CG; On+t 
=o + YD + 


i=) 20g +++ A; 142 *°* An 


We define z, as 


tn = tot DY 


i=1 412 °°: a; 
so that by (1.5), (1.6), and the hypothesis a; = 2 we have 


$2 peepee 
02 °**Gn 2” 
Thus the sequence a — 2X, tends to zero as n tends to in- 
finity, and this implies (1.2). 
Next we establish that ¢; < a; — 1 for infinitely many 
t. On the contrary, suppose that there is a fixed integer 
n so that c; = a; — 1 for alli > mn. Then we use (1.2) 
and (1.3) to obtain 


n eo 
Cj a;— 1 
a= a+ ———+ 
tl @1Qq *°* A; t=n+1 44g °** A; 
n 
t=] @jAq °° * A; 
fe 3} 
ee 5 eae 
Q1Qq **° Gn imi On414n42 °° * An4i 
> Cy I 


f 
& 
+ 

M 
+ 
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A comparison of this with (1.6) yields a,41 = 1, contrary 
to (1.5). 

Finally we establish the uniqueness of the representa- 
tion (1.2). Suppose that 


where the integers b; satisfy the same conditions as do the 
c; We must prove that b; = c; for every 7. The identity 
(1.3) implies that 


because of the condition that b; < a; — 1 for infinitely 
many 7. Hence we see that bo is the integral part of a; 
thus bo = [a]. Also co = [a] by (1.4), and so bo = e. 
Next suppose that for some n 2 1 the pair b, and c, are 
unequal. There is no loss of generality in assuming that 
nm is the smallest integer with this property, and that 
Cn > bn, so that c, — b, 2 1. Hence we have 


2 Cz = b; 
SE ay 
ton QQ °** A; t=n G42 °° Qj 


and, since these series are absolutely convergent, we may 
rearrange terms to get 


ba b; — ¢; Cn — bn 1 
= —_——~ 2 


t=n+1 41@q °++ Aj Q14q °-** An QA *°* An 


We obtain a quite different inequality on the magnitude 
of this infinite sum by again using the fact that b; < a; ~ 1 
for infinitely many 7, so that b; — c; < a; — 1 for infi- 
nitely many 7. We employ this and (1.3) to write 


10 RATIONALS AND IRRATIONALS Ch. 1 


= by — 6 = a;—1 
ian+1 202 *°* Ay jon41 @182 °°* a 
1 > Qnyi— 1 
Gydq °** On i=1 An414n42 °°* AOn4i 
1 
7 G1Qg °°° dn 


Thus we have a contradiction, and the proof of the theo- 
rem is complete. 

We remarked that Theorem 1.6 is a generalization of 
the ordinary decimal expansion of a real number a. This 
can be seen by taking all the integers a; = 10. Thus, if 
a is positive, the equation (1.2) gives the decimal repre- 


sentation 
@ 


(1.7) a=o+> Bs = C.C1C2C3 ° 
i=1 10° 

For negative a we get the decimal expansion by first get- 
ting —qa in the form (1.7) and then changing all signs, 
It may be noted that any real number a with an ambigu- 
ous decimal representation, such as a = 0.42, is given by 
(1.7) in the form with an infinite succession of zeros, thus 
a = 0.42 = 0.42000---. The reason for this is that an 
infinite succession of nines is ruled out by the condition 
of the theorem that c; < a; — 1 for infinitely many 7, a 
condition which in the present special case takes the form 
c; < 9 for infinitely many 7. 

We now give conditions which guarantee that the num- 
ber a represented by (1.2) is irrational. 


THEOREM 1.7. Let the integers a; be as described in the 
preceding theorem, and let the integers c; satisfy the inequali- 
ties of that result. Furthermore, assume that an infinite 
number of the c; are positive, and that each prime number 
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divides infinitely many of the a; Then a in (1.2) is irra- 
tional. 

Proof. Suppose on the contrary that @ is rational; say 
a =h/k. Then by the last hypothesis of the theorem we 
can choose an integer sufficiently large so that k is a 
divisor of the product a@,@2 --+ @,. Then, replacing a in 
(1.2) by h/k, we multiply by a,azg --+ a, and rearrange 
terms to get 


Q1dg +++ An(h — Cok) 2 Cijg +++ An 
k j=l G02 *°- A; 

= < Cn4i , 

t=1 In414n42 °°* An+i 


The left side is an integer, and we get a contradiction by 
establishing that the series on the right side converges to 
a value between 0 and 1. The series on the right side is 
positive by the hypotheses, and it is strictly less than 

2 Qn+~i — 1 


t=1 In410n42 °** An+i 
which has the value 1 by (1.3). 


Coro.tuary 1.8. ¢ 78 irrational. 
Proof. The standard infinite series expansion, 


os 1 1 1 
a eae apt 


satisfies the hypotheses of Theorem 1.7. 

Alternatively, we may avoid the use of Theorem 1.7 
and give a direct proof of the irrationality of e as follows. 
Assuming that e = h/k with k positive, we observe that 
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is an integer. But this becomes, when e is replaced by its 
series expansion, 


1 1 


+l) &+ DETR , 


+ GF DETDEP 


A contradiction arises because we can show that this series 
converges to a positive value less than one, and so cannot 
be an integer. For the series is strictly less than 


1 1 1 1 
@+h) &+D +s Tk 


This proof may be extended, albeit with some slight 
difficulty, to establish that e is not a quadratic irrational; 
ie., that ae? + be +c = 0 is impossible non-trivially in 
integers a, b, c. The trick at the start of the proof is to 
write the equation as ae + ce~' = —b, and to use the 
series expansion for e~! as well as that for e. We shall 
not pursue this, however, because in the next chapter we 
prove more about e in Theorems 2.10 and 2.12 with deeper 
methods. In particular, Theorem 2.12 is the generaliza- 
tion from the quadratic to the general case of the above 
assertion about the impossibility of ae? + be +c =0. 
Theorem 2.12, in turn, will be generalized in another way 
in Chapter 9, Theorem 9.1. 

We now return to the topic of decimal expansions, 
which as we have observed are a special case of Theorem 
1.6. The decimal .a;a,a3 --- is said to be periodic if there 
exist positive integers r and s such that a, = a,4, for all 
n> &. 


THEOREM 1.9. A real number a has a periodic decimal 
expansion if and only if « is rational. 
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Proof. Inasmuch as the integral part [a] of a has no 
relevance here, we may assume that [a] = 0, or that 
O0xa<1. First, if a is a periodic decimal, say 


a= 21,2 «++ Agbjbg --- b,b, bg eee bp eee, 


then the infinite decimal .b,;b2 --- b-bjbg --- b, --- is the 
fractional part of both 10%a and 10°*’a, so that 


(1.8) 10°a — [10%] = 10°*a — [10°t a]. 


Thus we can write 
(10° ta] — [10°a] 
10°+" — 10° 
which exhibits a as the quotient of two integers. 
Conversely, we assume that a is rational: say a = h/k 
with k > 0. The fractional parts of 


(1.9) a, 10a, 10%a, 10%a, --> 
are values in the finite set 
1 2 k-—1 
0, eel Mien Se Seemed 
kk k 


Hence we can find two members, say 10°a and 10°*'a, of 
the infinite sequence (1.9) having the same fractional 
parts, and this gives the equation (1.8). Define the posi- 
tive integer n by the equation 


n = [10°t"a] — [10°el, 
and then we have from (1.8) that 
10°a = n/(10" — 1). 
Define n; = [n/(10” — 1)], so that 
10° = n/(10" — 1) = nm, + n2/(10" — 1), 


where the integer nz satisfies 0 S$ ny <10"—1. This 
equation can be rewritten in the form 
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10°a — n Ba ID ie ae Hoe 
+40" © 102 * 108° 


and the infinite sum on the right side is seen to be a peri- 
odic decimal because of the inequality satisfied by the in- 
teger n2, whence 


10°a ~ ny, = bi be +++ bby be aes b, ae DS 


By adding 7, to both sides, and then multiplying by 
10°, we establish a as a periodic decimal. 


Notes on Chapter 1 


§ 1. The analytic foundation of the theory of real numbers, i.e., 
the construction of the rea] numbers from the rational numbers by 
such methods as Dedekind cuts or Cauchy sequences, is not treated 
here. Excellent formulations of this can be found in Chapter III 
of Birkhoff and MacLane, ¢ Survey of Modern Algebra, or Chapters 
III and IV of E. Landau, Foundations of Analysis. 

The word “incommensurable”’ is frequently found in the earlier 
literature. Two line segments, or two numbers, are incommensura~ 
ble in case their ratio is irrational. The origins of these words and 
concepts are delineated in O. Neugebauer, The Exact Sciences in 
Antiquity, Princeton (1952); George Sarton, A History of Science, 
Harvard (1952); and Morris Kline, Mathematics in Western Culture, 
Oxford (1953). 

§ 2. The word “‘countable” is frequently used in a broader sense 
to include also the finite sets; cf. R. L. Wilder, Introduction to the 
Foundations of Mathematics, p. 84. 

§ 4. Theorem 1.7 is due to G. Cantor, Zett. Math. Phys., 14 
(1869), 121-128; Collected Papers, Berlin (1932), pp. 35-42. Various 
extensions of this result have been made; cf. M. R. Spiegel, Amer. 
Math. Monthly, 60 (1953), 27-28; A. Oppenheim, Amer. Math. 
Monthly, 61 (1954), 235-241; P. H. Diananda and A. Oppenheim, 
Amer. Math. Monthly, 62 (1955), 222-225. 


+ Wherever in the Notes the reference to a book is incomplete as 
to publisher, edition, etc., complete information is given in the list 
of Reference Books on p. 157. 


CHAPTER 2 


SIMPLE IRRATIONALITIES 


1, Introduction. In this chapter we establish the ir- 
rationality of some well-known numbers of elementary 
mathematics. The general criteria for irrationality given 
in Theorems 1.7 and 1.9 will not suffice for these num- 
bers, and so in the next section we introduce a technique 
of Diophantine approximation. However, before getting 
into that, we set forth one more elementary criterion for 
irrationality, the Gaussian -generalization of the usual 
proof that +/2 is irrational. 


TuHroreM 2.1. If the real number x satisfies an equation 
2 + ez" 14+---+e, =0 


with integral coefficients, then x is either an integer or an 
trrational number. 

Proof. Suppose that the number z is rational, say 
x = a/b, where the integers a and b > 0 satisfy (a, b) = 1. 
Then we would have 


a” = —b(cya"~? + coa®—*b +--+ nb). 


If b > 1, then any prime divisor p of b would divide a”, 

and so p would divide a by the fundamental theorem of 

arithmetic (cf. glossary). But this contradicts the condi- 
15 
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tion (a, b) = 1, and so b = 1, which establishes that z, if 
rational, is an integer. 


The irrationality of +~/2 is the particular case of Theo- 
rem 2.1 where the equation considered is zx? — 2 = 0. 
More generally, by applying Theorem 2.1 to the equation 
x” — m = 0, we obtain the following result. 


Coroutary 2.2. If m is a positwe integer which is not 
the n-th power of an integer, then x/'m is irrational. 


2. The trigonometric functions and 7. Next we 
establish the irrationality of the trigonometric functions 
for non-zero rational values of the arguments. It will be 
convenient to have two lemmas. 


Lemma 2.3. If h(x) = x” g(x)/n! where g(x) is a poly- 
nomial with integral coefficients, then h (0), the j-th deriva- 
tive of h(x) evaluated at x = O, is an integer for j = 0, 1, 2, 

Moreover, with the possible exception of the case j = n, 
the integer h(0) is divisible by n + 1; no exception need 
be made in the case j = n if g(x) has the factor x: ie., if 
g(0) = 0. 

Proof. We may write 


h® (0) = e;(7!)/n! 


where c; is the coefficient of x’ in the polynomial x” g(z), 
so that c; is an integer by hypothesis. For 7 < n we have 
c; =0. Forj> 7 it is apparent that h%(0) is divisible 
by n+ 1. In case j = n we have h™(0) = cp, and, if 
g(0) = 0, then c, = 0. 


Lemma 2.4. If f(z) is a polynomial in (r — x)*, then 
f(r) = 0 for any odd integer j. 

Proof. For odd j, f(z) is seen to be a polynomial in 
odd powers only of (r — x), and the result follows. 
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THEOREM 2.5. For any rational number r ~ 0, cos r is 
trrational. 

Proof. Since cos (—r) = cos r, it will suffice to prove 
this for positive r = a/b, where a and b are positive inte- 
gers. Define 
a?—(a — bx)??(2a — bx)? 


(2.1) f(z) = 


(p — 1)! 
_ r= 2)?{r? — (r — 2)? Pb 
= (p — 1)! 


where p is an odd prime to be specified. Note that, for 
0O<2<r7, 


p2P{,2\P—1p3p—1—_tp—2p8P—1 
@-D! (p—-D! 

Next, using all the even derivatives of f(x), we define 

(2.3) F(a) = f(z) — f?@) +I%@) -f@ 

feeee f4P—2) (2). 


(2.2) O< f(x) < 


Thus we have 


2 {F’(x) sin x — F(z) cos x} 
dz 


= F(z) sin z + F(x) sinz = f(z) sin z, 
and 


(2.4) fre sin z dz = F’(r) sinr — F(r) cosr + F(0). 
0 


Now f(x) is a polynomial in (r — x)”, and so by (2.3) 
and Lemma 2.4 it follows that F’(r) = 0. But also f(z) 
has the form of h(x) in Lemma 2.3, with n here replaced 
by p — 1. Thus we observe that f(0) is an integer for 
every value of j, and moreover that f”(0) is a multiple 
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of p unless j = p — 1. By direct calculation from (2.1) 
we have 
fo-(0) = (2a), 

The odd prime p will be chosen to satisfy p > a, and, 
since a is a positive integer, it follows that f?—) (0) is not 
divisible by p. Thus we conclude that F(0) is an integer 
not divisible by p; say F(0) = q with (p, q) = 1. 

Continuing our examination of the right side of (2.4), 
we now study F(r). The definition (2.1) implies that 


x2? {r? ron x7}? —1p3P—! 
(p — 1)! 

xP { a? = 227}? 1p? +1 
(p — 1)! 


Thus f(r — x) has the form of h(x) in Lemma 2.3 with n 
replaced by p — 1 and g(z) replaced by 


fr - 2) = 


gPtl {a? _ b2x?}P—1pPt, 


Hence by Lemma 2.3 we see that f(r) is, for every j, an 
integer divisible by p, and so F(r) has the form pm where 
m is some integer. 

We assume that cos 7 is rational; say cos r = d/k, with 
integers d and k > 0. In view of this and our analysis of 
F'(r), F(0), and F(r), we can rewrite (2.4) as 


(2.5) Ef $0 sinz dz = —pmd + kg. 


From this we shall get a contradiction by choosing the 
prime 7p sufficiently large so that the right side of (2.5) is 
a non-zero integer, and so that the left side is a value be- 
tween +1 and —1. We have already specified that p 
should satisfy p > a, and now we add the requirement 
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p>k. Thus p is not a divisor of kg, and it follows that 
—pmd + kg is a non-zero integer. 

Turning to the left member of (2.5), we use (2.2) to 
write 
4p—2} 3p—1 43 \p—1 
et 5 
(p= 1)! (p — 1)! 
cc */(p — 1)! 


where the constants c,; = kr®b? and co = r*b® are inde- 
pendent of p. Now, as p tends to infinity, c,c3~'/(p — 1)! 
tends to zero, and so we can choose p sufficiently large so 
that the left member of (2.5) lies between +1 and —1. 
This completes the proof of Theorem 2.5. 

The above method of proof will be used several times in 
this chapter. Before proceeding, let us note the nature 
of the method. The heart of the proof lies in equation 
(2.4). Since the left member of (2.4) is arbitrarily small, 
and since F’(r) = 0, what equation (2.4) gives us is a 
very good rational approximation to cos r: namely F(0)/ 
F(r). Now, if cos r were rational with denominator k, so 
would —F(r) cos r + F(0) be rational with denominator 
k, and not zero because of the divisibility properties of 
the arbitrarily large prime p. Thus —F(r) cos r + F(0) 
would have absolute value at least 1/k, which is impossible 
since the left member of (2.4) can be made arbitrarily close 
to zero. 


Vf f(x) sin x dx| < kr 
0 


I 


3 b? 


Corouuary 2.6. x ts irrational. 

Proof. If x were rational, then cos r would be irra- 
tional by Theorem 2.5, whereas cos t = —1. 

Alternative proof. Instead of inferring the irration- 
ality of r from Theorem 2.5, we can use a straightforward 
argument which avoids some of the complications of the 
theorem. Moreover the proof which we are about to give 
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establishes the slightly stronger result that 1? is irrational. 
Define 


(2.6) (2) = 


a™(l — x)” 
n!} 


where n is a positive integer to be specified. Note that, 
for0O <2 <1, 


1 
(2.7) 0<f(r) <—- 

n! 
By Lemma 2.3, f%(0) is an integer for every value of j. 
Since f(1 — x) = f(z), it follows that f% (1) is also an in- 


teger for every j7. Assume that 7” = a/b, where a and b 
are positive integers, and define 


F(z) = b"{ x?” f(x) mt. nen? f2) (x) + gen 4 f(x) 
ee (—1)" fP” (z)}, 
so that F(O) and F(1) are integers. We see that 


a {F’(x) sin rx — x F(x) cos rz} 


dz 
= {FO (2) + x? F(z)} sin wx 
= b% 7?" +2 f(z) sin rz 
= ra” f(z) sin rz, 

and so 


1 
(2.8) a” f f(z) sin ra dz 
0 


1 


— F(z) cos x2 | = F(1) + F(O). 
0 


[= sin 72x 
v 


Now F(1) + F(O) is an integer, but from (2.7) we have 


1 ra™ 
0< raf f(e) sin rede <= <1 
0 nN: 
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for sufficiently large n. Thus we have a contradiction in 
a simpler way than was the case with equation (2.5): it 
is simpler because we know that the integral in (2.8) is 
positive. 


Corouuary 2.7. The trigonometric functions are irra- 
tional at non-zero rational values of the arguments. 

Proof. If sin r were rational for rational r ~ 0, so also 
would 1 — 2 sin? r = cos 2r be rational, contrary to Theo- 
rem 2.5. Similarly if tan r were rational, so would 


i — tan? r 


cos 2r = —————_ 
1+ tan?r 


be rational, again in contradiction to the theorem. Also 
esc r, sec 7 and ctn r are irrational since they are the re- 
ciprocals of irrational numbers. 

This proof of Corollary 2.7 establishes a little more: 
namely that sin? r, cos” r, etc., are irrational for r < 0. 

We have derived the irrationality of sin r and tan r 
from that of cos r. It is interesting that cos r seems to 
be necessarily the basic function in this process. That is, 
whereas the irrationality of sin r and tan r is implied at 
once by that of cos r, it appears to be not possible in 
such a simple way to infer the irrationality of cos r from 
that of either sin r or tan 7. This is the reason for our 
starting with the cosine function in Theorem 2.5. 


Coro.uaryY 2.8. Any non-zero value of an inverse trig- 
onometric function is irrational for a rational value of the 
argument. 

Proof. Let r be rational, and consider one of the values 
of arc cos r, say arc cos r = p, and assume that p is a 
non-zero rational number. Then cos p = 7, contrary to 
Theorem 2.5. A similar proof holds for the other func- 
tions. 
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3. The hyperbolic, exponential, and logarithmic 
functions. 


THEOREM 2.9. The hyperbolic functions are irrational 
for non-zero rational values of the arguments. 

Proof. Paralleling the proof of Theorem 2.5, we begin 
with cosh r where r = a/b > 0. We define f(x) as in 
(2.1); however, in the present proof we do not require 
that p be a prime but merely a positive integer. Equa- 
tions (2.3) and (2.4) are replaced by 


F(x) = f(z) + f(x) +f (x) + f(x) + + $6?) (2) 


and 


(2.9) f (0) sinh x dz 
= [F(x) cosh x — F’(x) sinh x) 
= F(r) cosh r — F’(r) sinh r — F(O). 


This is analogous to equation (2.4), but in the present 
case the integral in (2.9) is positive because of (2.2) and 
the fact that sinh x is positive forO <<z<r. (Knowing 
that the integral is positive enables us to give a shorter 
proof than in Theorem 2.5.) We apply Lemma 2.3 to 
f(z) and f(r — x) to conclude that f”(0) and f(r) are 
integers for every j. Thus F(0) and F(r) are integers. 
Also F’(r) = 0 by Lemma 2.4. If we assume that cosh r 
= d/k, with k > 0, then (2.9) can be written as 


k f Sedaka de Oa FO) ERO): 
0 


The right side is an integer, but by (2.2) we see that 
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piP—2p 3p —l e” ~— eT 


0 < kf se) sinh x dz < kr 
0 


@-D! 2 
322 /,7 — po? 4 p— 
_ brbi(e e ) {70°} = 
2 (p — 1)! 


for p sufficiently large. Thus we have a contradiction, so 
that cosh r is irrational. 

Next we proceed from cosh r to sinh 7 and tanh r along 
the lines of the proof of Corollary 2.7, using now the iden- 
tities 

1 + tanh? r 
cosh 2r = 2 sinh? r + 1 = ————_—_- 
1 — tanh? r 

An analogous result to Corollary 2.8 holds for the in- 

verse hyperbolic functions; we omit the details. 


THEOREM 2.10. Jf r is rational and r ¥ 0, then e” ts 
irrational. If r is rational and positive, and r ¥ 1, then 
log r ts irrational. 

The two statements are equivalent since r = e!°8" = 
log e’. Or, to say it another way, the two statements 
are related in the same way as are Corollaries 2.7 and 2.8, 
with inverse functions involved. It suffices therefore to 
establish the irrationality of e’, and we give two proofs. 

First proof. If e” were rational, so would be its recip- 
rocal e~’, and so also (e’ + e—")/2. But this is cosh r, 
and so we have a contradiction to Theorem 2.9. 

Second proof. This is a direct approach not involving 
hyperbolic functions. If e” were rational, so would its in- 
tegral powers be rational; so it suffices to consider only 
the cases where r is an integer. And, if r <0, we may 
deal with e~"; so we may presume that r is a positive inte- 
ger. Assume that e’ = a/b, with b > 0. 
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We define f(x) as in (2.6), and F(z) by the equation 
P(e) = 7" fla) — P* f'(@) + 9-2 (2) 
meee r f2"—)(z) + f2™ (2), 
so that F'(0) and F(1) are integers. We see that 


= (e F(a)} = or Fla) + PG) = He fla), 
7 


and : 
pant f e”” f(x) dz = ble’? F(z)I, =afF(i) —b F(0). 
0 


The right member is an integer, but by (2.7) we have 
pent] or (r*)* 
= bre’ —— <1 
n! n! 


1 
0< mnt f e” f(z) dx < 
0 


for n sufficiently large. Thus we have the usual contra- 
diction. 

The theorem just proved is an assertion about natural 
logarithms, and we now prove a similar result for common 
logarithms. The following simple result is stated for 
logarithms to base 10, but admits of generalization to any 
rational base. 


THEOREM 2.11. For any positive rational r, logig r is 
trrational unless r = 10” for some integer n. 

Proof. If logig r is rational, so is logo r~', and hence 
we may take r > 1. Let us write r = a/b with positive 
integers a and b satisfying (a, b) = 1. Assume that logig r 
= c/d with positive integers c and d satisfying (c, d) = 1. 
Then 10°? = a%, which implies that b = 1 and 10° = a?. 
Thus a must have the form 2%5” with positive integers u 
and v, and consequently c = ud. But (c, d) = 1, so that 
d = landr = a/b = a = a? = 10°. 
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If this theorem is generalized from logarithmic base 10 to 
any rational base b ¥ 1, it becomes almost tautological; 
thus: if rand b ~ 1 are positive rationals, then log, r is irra- 
tional unless there exist integers m and n such that r” = b”. 

We conclude with a result which, although somewhat 
out of keeping with the title of this chapter, is included at 
this point because the necessary techniques of proof have 
been fully developed. A much more general result, Theo- 
rem 9.1, is given later. 


THEOREM 2.12. e satisfies no relation of the form 
(2.10) ane™ + ame"! +:++++ aye + ap = 0 


having integral coefficients not all zero. (Stated otherwise, e 
is a transcendental number.) 

Proof. There is no loss of generality in presuming that 
ag ~ 0. We define 


2?" (x — 1)?(a — 2)?(x — 8)? --+ (29 — m)?” 
f(z) = Pn = ee 
(p=)! 
and 


F(x) = f@) + f'@) +I @ +f) 
eae porte) (gz), 


where p is an odd prime to be specified. ForO <2 <m 


we have 
P—lypPmP ..~ mP mrptp—-1 


m 
(2.11) |f(z)| < —Sane = Gop 


We readily verify that 
d 
— {e? F(z)} = e*{F'(x) — F(a)} = —e* f(z) 
dx 


and 


asf es) dz = a—e~* F(z)]é = a; FO) — ae F(}). 
8) 
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We multiply by e’, then sum over the valuesj = 0, 1, ---, 
m, and get, by use of (2.10), 


(2.12) Saye! f ‘e-* f(z) dz = — Daj FU) 
j=0 0 


j=0 
m mp+tp—l 
= ~2 Daf (p. 
t=O 


Now the application of Lemma 2.3 to f(x), f(x + 1), 
f(z + 2), ---, f(z + m), with n replaced by p — 1 in each 
case, shows that f(j) is an integer for all values of ¢ and 
j in the above sum. Even more, it shows that f(7) is an 
integer divisible by p except for the single case where 7 = 0 
and i= p-— 1. A direct calculation from the definition 
of f(z) establishes that 


JOON) (1) (52)? seam)? 


Thus f?—)(0) is not divisible by p if we choose p > m. 
Furthermore, if we choose p > |a |, we see that the right 
member of (2.12) consists of a sum of multiples of p with 
one exception, namely —apf?—) (0). Thus the sum on the 
right of (2.12) is a non-zero integer. But the left member 
of co satisfies the inequality, by (2.11), 


IA 


& ox! fe —= f(z) dz\ < 


j 

> ae [ e* f(x) dx 
0 

mptp— 


(p — 1)! 
mnpte—1 


= {Llaj|jer sry 


aan a 


< (Lali <i, 


S Dla;jle%7-1- 
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provided p is chosen sufficiently large. Thus we have a 
contradiction, and the theorem is proved. 


Notes on Chapter 2 


The irrationality of + (Corollary 2.6) was first proved by J. H. 
Lambert in 1761 by means of continued fractions. An €xposition of 
this approach to the topic is given in G. Chrystal, Algebra, vol. II, 
Chapter 34. 

The problem of “squaring the circle’ cannot be settled by the 
irrationality of x; it was solved by F. Lindemann in 1882 by the 
proof of the transcendence of x. This is discussed in § 4 and § 5 of 
Chapter 9. 

The basic technique of this chapter, and also of Chapter 9, is 
due to C. Hermite, Compt. Rend. Acad. Sci. Paris, 77 (1873), 18-24, 
74-79, 285-293. There have been various refinements and exten- 
sions of the technique by later writers. Complete references up to 
1934 are to be found in Chapter IV of J. F. Koksma, Diophantische 
Approximationen. 

The particular form of Hermite’s method used in Theorems 2.5 
and 2.9 developed from a proof of the irrationality of r given by us 
in Bull. Amer. Math. Soc., 53 (1947), 509. This was extended to 
x by Y. Iwamoto, J. Osaka Inst. Sci. Tech. (1) 1 (1949), 147-148; 
to the exponential function by Z. Butlewski, Collog. Math., 1 (1948), 
197-198, and by J. F. Koksma, Nieww Arch. Wiskunde (2) 23 (1949), 
39; and to solutions of certain differential equations by R. Rado, 
J. London, Math. Soc., 23 (1948), 267-271. Other simple approaches 
to w and tan r have been given by J. Popken, Norsk. Mat. Tidsskr., 
26 (1944), 66-70; Math. Centrum Amsterdam, rapport ZW1948-014, 
5 pp. (1948); and R. Breusch, Amer. Math. Monthly, 61 (1954), 631— 
632. 

The proof of Theorem 2.5, which we believe is new, was improved 
after discussion of an earlier version with Professor Saunders Mac- 
Lane. The unified treatment of x and e” in Corollary 2.6 and Theo- 
rem 2.10 by use of (2.6) follows Hardy and Wright, The Theory of 
Numbers, Chapter TV. The proof of Theorem 2.12 is based on that 
of A. Hurwitz, Math. Annalen, 43 (1893), 220-222, or Math. Werke, 
Basel, II (1933), 184-135. 


CHAPTER 3 


CERTAIN ALGEBRAIC NUMBERS 


1, Introduction. The trigonometric functions for ra- 
tional values of the argument or angle were discussed in 
the preceding chapter. Now we examine the values of 
the trigonometric functions for those angles that are ra- 
tional multiples of +: i.e., for angles that are rational when 
measured in degrees. Apart from trivial exceptions these 
values are irrational numbers, but they are also algebraic 
numbers whose degrees we shall calculate. We now state 
the prerequisite material needed up to § 4 of this chapter; 
one or two additional results are required in § 5 and are 
given there. 

An algebraic number is one that satisfies some equation 
of the form 


(3.1) x” + az" 14---+a, = 0, 


with rational coefficients. A polynomial having leading 
coefficient 1, such as that in (3.1), is called monic. Any 
algebraic number a satisfies a unique monic polynomial 
equation of least degree, called the minimal polynomial of 
a and the degree of a. Thus rational numbers coincide 
with algebraic numbers of degree 1. Also the minimal 
polynomial of @& is irreducible over the rational numbers, 
and it is a divisor of any other polynomial with rational 
coefficients having a as a zero, and it is the only monic 
28 
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polynomial having these properties. Proofs of these sim- 
ple properties of algebraic numbers can be found in the 
Carus Monograph of Harry Pollard, The Theory of Alge- 
braic Numbers, pp. 35-36. 

If an algebraic number a satisfies some equation (3.1) 
with rational integral coefficients, we say that a is an al- 
gebraic integer. For example, of the algebraic numbers 
1/3 and +/3/2, only the first is an algebraic integer. 
The minimal polynomial of an algebraic integer is also 
monic with integral coefficients. This can be proved by 
applying Gauss’s lemma, a form of which we now state: 
if a polynomial f(x) with rational integral coefficients can 
be factored into two polynomials with rational coefficients, 
say f(r) = g(x) h(x), then there is a non-zero rational 
number r such that r g(x) and r—! h(z) have rational inte- 
gral coefficients; for proof see Pollard, Theorem 3.7. We 
shall use later a special case of Gauss’s lemma, namely: if 
f(x) = g(x) h(x), and if f(z) and g(x) are monic with in- 
tegral coefficients, then also h(x) is monic with integral 
coefficients. 

A complex number that is not algebraic is called trans- 
cendental. We shall establish the transcendental charac- 
ter, or transcendence, of some classes of numbers in Chap- 
ters 7, 9, and 10; the number e was treated in Theorem 
2.12. 

Our purpose in this chapter is to prove the algebraic 
nature of the trigonometric functions at arguments that 
are rational multiples of +, and to determine the degrees 
of these algebraic numbers. This necessitates looking in- 
to the simplest parts of the theory of cyclotomic polyno- 
mials, a topic of interest in its own right. 

It may be noted very simply that the trigonometric 
functions are algebraic numbers at arguments that are 
rational multiples of +. First consider cos 6 with @ = 
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2xk/n, where k and n are relatively prime integers. We 
have (cos@+7sin 6)” = 1 by De Moivre’s theorem. 
Writing the binomial expansion of the left side of this 
equation, we equate the real parts to obtain a polynomial 
equation in cos @ and sin 6, the latter occurring to even 
powers only. Replacing sin? @ by 1 — cos? 6, we get an 
algebraic equation satisfied by cos 6. (This equation does 
not vanish identically: e.g., the coefficient of cos” @ is 
2-1) Also sin @ is algebraic, since sin 6 = cos (@ — 1/2). 
Furthermore, if the imaginary parts are equated in the 
expansion of (cos 6+ 7 sin 6)” = 1, and if the result is di- 
vided through by cos” @, the equation obtained establishes 
that tan 6 is algebraic. 


2. Further background material. The factorization 
of x” — 1 in the field (cf. glossary) of complex numbers, 


| 
y7>—1= Il (x = eFrikiny 
k=0 


— Qrk 2rk 
Il (2 - te = isin“), 
n 


k=0 n 


is well known, the zeros of the polynomial being the nth 
roots of unity. In §3 we shall determine for later use 
the complete factorization of x” — 1 in the field of ra- 
tional numbers for any positive integer n. Among the 
nth roots of unity a special role is played by the so-called 
primitive roots, namely e?***/" for values of k such that 
(k,n) = 1. There are ¢(n) primitive nth roots of unity, 
o(n) being the Euler function with value 


eo =n(--)0-2-G- 


where p1, Pe, -**, Pr are the distinct prime factors of n. 
If w is any primitive nth root of unity, so also is w’ for 
any exponent j satisfying (j, n) = 1. 
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For any prime p, the numbers 0, 1, 2, ---, p — 1 con- 
stitute a finite field: i.e., a field with a finite number of 
elements, addition and multiplication being defined mod- 
ulo p. (This assertion can be readily verified, in case the 
reader is unfamiliar with these concepts, from the defini- 
tion of “‘field” given in the glossary. The only point of 
any difficulty whatsoever is the multiplicative inverse of 
a non-zero element a: the congruence ax = 1 (mod p) is 
known to have a unique solution + among the values 1, 2, 
+++, p — I, and this solution is the inverse of a, denoted 
by a~!.) This finite field is denoted by J,; and by J,[z] 
we shall mean the set of all polynomials in x with coeffi- 
cients in Jj. The degree of a non-zero polynomial is the 
exponent of the highest power of x, so that the non-con- 
stant polynomials are those of degree at least one. 

We say that the polynomial g(x) in J,[z] is a divisor or 
factor of f(x) in case there is a polynomial g(x) such that 
f(z) = g(x) q(x). Any non-constant polynomial is said to 
be irreducible if it cannot be factored into two non-con- 
stant polynomials in J,[z]. Either any non-constant poly- 
nomial is irreducible or it can be factored into irreducible 
polynomials in J,[z]. One basic result which we shall 
need is that this factorization into irreducible polynomials 
is unique. The proof of this is so similar to the proof of 
the unique factorization theorem for polynomials over the 
rational numbers that we merely sketch the broad steps: 
the details are quite analogous to those given for example 
by Pollard, pp. 24-26. 

First, if f(z) and g(x) are polynomials in J,[x] of de- 
grees n and m, respectively, and if n = m, then there is 
an element c in J, such that the expression 


Se) — ea? g(x) 


is zero or a polynomial of degree less than n. From this 
we can get a Euclidean algorithm for J,[z], namely: 


32 CERTAIN ALGEBRAIC NUMBERS Ch. 3 


Given any two polynomials f(x) and g(x) ¥ 0, there ex- 
ist polynomials g(x) and r(x) such that 


f(z) = g(x) a(x) + ra) 


where r(x) = 0 or r(z) is of lower degree than g(x). Next, 
this algorithm can be used to prove that, given any two 
non-zero polynomials f(z) and g(x) having no common 
factor of degree one or more, then there exist polynomials 
s(x) and t(z) in J,[z] such that 


f(z) s(x) + g(x) tx) = 1. 


Now let m(x) be an irreducible polynomial which is a 
divisor of a product of two polynomials u(z) v(x) but is 
not a divisor of u(x) alone: then m(z) is a divisor of v(x). 
This chain of results leads to the unique factorization 
property, which we state as a lemma for reference. 


Lemma 3.1. Any non-constant polynomial f(x) in Jp[2] 
can be factored into a product of monic irreducible polyno- 
mials 

f(x) = em,(x) m2(z) +++ m,(x) 


in one and only one way apart from the order of the factors, 
where c ts the leading coefficient of f(x). 


Lemma 3.2. If g(x) is any polynomial in J,[z], then 
{g(x)}? = g(x?). 

Proof. If g(x) is a constant polynomial c, this result is 
Fermat’s theorem to the effect that c? = c (mod p). We 
can prove the general result by using induction on the de- 
gree of the polynomial g(z). Suppose that the lemma 
holds for any polynomial of degree less than n. Then, if 
g(x) has degree n, it can be written in the form 


g(x) = ax” + g;(x), 


where g(x) has degree less than n. Then we have 
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{g(2)}? = (ax*)? + (?) (a2")P—! g,(2) 


+ @ (az™)?—? {9 (2)}2 42+ {gila)}?. 


But by the induction hypothesis {g;(x)}? = gi(z”), and 


it is well known that the binomial coefficient .) is a 
Jj 


multiple of p for any integer j in the range 0 <j < p. 
Hence the above equation implies that 


{g(a)}? = aPu"? + gi (x?) = ax”? + gi(2”) = g(x"). 


Finally we state a lemma concerning the following for- 
mal type of differentiation of polynomials of J,[z]. The 
derivative of 


f(a) = ag” + az" ++++ an 
is defined as 
f'(x) = nagx™! + (n — 1)ayx”~* ++ ++ ag_y. 
Lemma 3.3. If f(x) and g(x) are any polynomials in 
J p(x], then 
{ f(x) g(x) }' = f(x) g'(@) + F'@) g(@). 


A proof of this lemma can be readily constructed, for 
example, by expanding both sides of the equation; so we 
omit the details. 


3. The factorization of x" —1. By F,(x) we shall 


mean the cyclotomic polynomial of degree ¢(n) whose zeros 
are the primitive nth roots of unity; thus 

n—l 
(3.2) P.@)= TT ee"), 

k=0 


(k,n) =1 
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The gist of this section is that the equation in the next 
theorem gives the complete factorization of z" ~— 1 into 
polynomials with rational coefficients, or for that matter 
with rational integral coefficients. 


TuHrorem 3.4. Forn 2 I, 


2” ~1= ]] Fa(z), 
d\n 


the product being over all divisors d of n. 

Proof. Any zero of x” — 1, say e?"**/", is a primitive 
dth root of unity for d = n/(k, n). Hence any linear fac- 
tor of 2” — 1 occurs in some F(x). Conversely any linear 
factor of Fa(x) is of the form x — w where w is a primitive 
dth root of unity. But any dth root of unity is also an 
nth root of unity, so that any linear factor of any F4(x) is 
a factor of z* — 1. The proof is completed by noting 
that just as the linear factors of x" — 1 are distinct, so 
are the linear factors of IIFa(x), since a primitive dth root 
of unity is not also a primitive d,th root of unity, d and d; 
being assumed unequal. 


THEOREM 3.5. Forn 2 1, F,(z) is a monic polynomial 
of degree o(n) with rational integral coefficients. 

Proof. The polynomial F(z) is monic by the equation 
(3.2), and its degree is ¢(n) since there are exactly that 
many primitive nth roots of unity. We use induction on 
n to prove that the coefficients are integers. For n = 1 
we have F;(xz) =2x—1. Assuming that all F,,(x) for 
m <n have integral coefficients, define 


2-1 
F,(z) 5 


gi(z) = H F,(z). 


dcn 


Then gi(x), a product of monic polynomials with integral 
coefficients, is monic with integral coefficients. The ap- 
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plication of Gauss’s lemma to the equation 2” —1 = 
g,(x) F,,(x) establishes that F(x) has integral coefficients. 

Next, let w be a primitive nth root of unity, and define 
f(z) as the minimal polynomial of w. As we shall see, it 
turns out that f(z) so defined and F,,(x) defined in (3.2) 
are identical. Now f(x) is monic by definition, and has 
integral coefficients since w, being a root of <” — 1 = 0, 
is an algebraic integer. 


Lemma 3.6. If p is any prime such that (p,n) = 1, and 
af p is any root of f(x) = 0, then p” is also a root. 

Proof. Since w is by definition a zero of both f(x) and 
F,(z), it follows that the minimal polynomial f(x) is a 
divisor of F,,(x), so that all the roots of f(z) = 0 are prim- 
itive nth roots of unity. Furthermore, since f(x) is irre- 
ducible, it is the minimal polynomial not only of w but 
also of each of its zeros. By the fundamental theorem on 
symmetric polynomials (cf. glossary) the pth powers of 
the roots of f(z) = 0 are roots of a polynomial equation 
which is also monic with integral coefficients, and so p? is 
also an algebraic integer. 

Let g(x), monic with rational integral coefficients, be 
the minimal polynomial of p?. We establish that g(z) = 
f(z), and from this the lemma follows at once. Suppose, 
on the contrary, that f(z) ¥ g(x). Then, being irreduci- 
ble, f(z) and g(x) would have g.c.d. 1. But g(2) is a di- 
visor of x” — 1 since they have the common zero p?, an 
nth root of unity. Similarly f(x) is a divisor of z* — 1, 
so that the product f(z) g(x) is a divisor of x” — 1, say 


(3.3) xz" — 1 = f(z) g(z) q(2). 


Gauss’s lemma implies that q(x) is also monic with ra- 
tional integral coefficients. Now p is a root of g(x?) = 0, 
so that f(x) is a divisor of g(x”), say g(x?) = f(x) Q(z), 
with Q(z) again monic with integral coefficients. 
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We now interpret these equations in J,[z]. By Lemma 
3.2 the last equation can be written as {g(x)}”? = f(z) Q(z). 
Let k(x) be any irreducible polynomial factor of f(x) in 
J,{z]. Thus k(x) is a divisor of {g(x)}?, and by Lemma 
3.1 it follows that k(x) is a divisor of g(x). Hence by 
equation (3.3), 2” — 1 is divisible by {k(x)}?, say 2” — 1 
= s(x) {k(x)}*. Upon formal differentiation of this equa- 
tion, we have, by Lemma 3.3, 


na"~* = k(x) {8'(x) k(x) + 28(x) k’(x)}. 


Note that nz"~! is not zero in J,[z] since (n, p) = 1. 
But the only irreducible polynomial factor of nx"! is z, 
so that k(x) = x, which is impossible since k(x), being a 
factor of f(z), is also a factor of x” — 1 by (8.3). This 
contradiction establishes that f(x) = g(r), and this com- 
pletes the proof of Lemma 3.6. 


THEOREM 3.7. The cyclotomic polynomial F,,(x) is irre- 
ducible over the field of rational numbers. 

Proof. We know that f(x), as defined just before 
Lemma 3.6, is irreducible since it is a minimal polyno- 
mial. We shall establish that any primitive nth root of 
unity is a root of f(z) = 0, and it follows that f(z) = 
F(x), which proves the theorem. Any primitive nth 
root of unity can be written as a power of the particular 
root w which was used to define f(x), say w’ where (t, 7) 
= 1. If the factorization of ¢ into (not necessarily dis- 
tinct) primes is t = pipe --: Pe, then (p;, 7) = 1 for all 
subscripts 7. Then, by Lemma 3.6, w”! is a root of f(z) = 
0, and by iteration of the argument w?!”? is a root of f(z) 
= 0, and so by induction w is a root of f(z) = 0. 


4. Certain trigonometric values. The reciprocal of 
a primitive nth root of unity is also a primitive nth root 
of unity. Moreover, the reciprocals of a complete set of 
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primitive nth roots of unity are the same numbers again 
in some order, and hence the polynomial F,(x) defined by 
(3.2) has the property that z?™ F,(x~!) = F,(z). Thus 
the coefficient of z’ in F,(x) equals the coefficient of 
z?™-—I for any j in the range 0 Sj < ¢(n). Let n> 2 
so that $(n) is even, say ¢(n) = 2m. Then x2” F,(z) 
has the form 


xo” F,(z) = (2 + 27”) + ay (x™) 4 a!) 
tee Om_i(@ + 2") + an, 
with integral coefficients. Now the identity 
a bee = (2 + 27) (ab! 4 gt*) — (gt? 4 2? -#) 


enables us to establish readily by mathematical induction 
that x* + 2~* is a monic polynomial in x + x! of de- 
gree k with integral coefficients. This analysis proves the 
first part of the following proposition. 


Lemma 3.8. Let n > 2, and define m = ¢(n)/2. Then 
xz" F,(z) is a monic polynomial in x+ 271, say 
¥n(z + x1), with rational integral coefficients. Also ¥,(z) 
ts irreducible of degree m. 

Proof. It is clear that ¥,(x) has degree m; so it remains 
to prove the irreducibility. Suppose contrariwise that 
Wr(z) = hy (x) he(xz), where hy and hp are non-constant 
polynomials in x, with h, say of degree r. Then we could 
write 


F,(z) 


a” p,(x + 271) 
{a hye + ")}-{2™~ hoa + z)}, 
and this contradicts Theorem 3.7. 


TaHEoreM 3.9. (D.H.Lehmer) Jf n> 2 and (k,n) = 
1, then 2 cos 2rk/n is an algebraic integer of degree o(n)/2. 
For positive n # 4, 2 sin 2nk/n is an algebraic integer of 
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degree $(n), $(n)/4, or $(n)/2 according as (n,8) < 4, 
(n, 8) = 4, or (n, 8) > 4. 

Proof. Since e?**/" is a root of F(z) = 0, it follows 
that 2 cos 2xk/n = e?™*/m 4 e~2rtk/" is 9 root of Wa(z) = 
0, and so is an algebraic integer of degree ¢(n)/2 by 
Lemma 3.8. Next we observe that 


2 sin 2xk/n = 2 cos 2r(4k — n)/4n, 


and so we investigate the lowest terms of the fraction 
(4k — n)/4n. This fraction (i) is in its lowest terms if n 
is odd, (ii) reduces to a fraction with denominator 2n if 
n = 2 (mod 4), and (iii) reduces to a fraction with de- 
nominator 7 or smaller if n = 0 (mod 4). 

In case (i) we have, letting d denote the degree of the al- 
gebraic integer 2 sin 2rk/n, d = $(4n)/2 = ¢(n). In case 
(ii) we have d = $(2n)/2 = ¢(n). Case (iii) separates in- 
to two subcases as follows. If n = 0 (mod 8) then, k be- 
ing odd, the fraction (4k — n)/4n reduces to a fraction 
with denominator n, so that d = ¢(n)/2. On the other 
hand, if n = 4 (mod 8) the fraction (4k — n)/4n reduces 
to one with denominator 7/4 in case k = n/4 (mod 4) and 
denominator n/2 otherwise, and it is readily calculated 
that d = ¢(n)/4 in both instances. 

The restriction n ¥ 4 in the sine part of the theorem is 
needed to avoid having (4k — n)/4n reduce to denomina- 
tor 1 or 2, in order that the cosine result is always ap- 
plicable. 


5. Extension to the tangent. We now extend 
Lehmer’s theorem to the case of tan 2rk/n, an algebraic 
number whose degree we compute. However, the num- 
ber 2 tan 2rk/n is not always an algebraic integer, for ex- 
ample in the case k = 1, n = 12. Our procedure necessi- 
tates the use of a couple of results on algebraic number 
fields which, though fairly elementary, are more advanced 


Sec. 5 EXTENSION TO THE TANGENT 39 


than the ideas previously employed in this chapter. We 
state these results now. Let R denote the field of rational 
numbers, and R(u) the extension of R by the algebraic 
number u. The degree of R(u) over FR is the same as the 
degree of u. The principal result required is that, if H is 
a finite extension of R, and K in turn a finite extension of 
H, then 
{K:R] = (K:H](H:R], 


where the notation [K:R] denotes the degree of K over 
R. (For proof see Pollard, p. 51.) A special case of this 
result is that, if w and v are algebraic numbers of the 
same degree, and if R(v) is a subfield of R(u), then R(u) 
= R(v). We write d(u) for the degree of the algebraic 
number w. 

Lemma 3.10. R(cos 26) is a subfield of each of the fields 
R(cos 6), R(sin 6) and R(tan 6), for any argument 6 for 
which tan 6 exists. 

Proof. This is clear from the identities cos 20 = 2 cos? 6 
—1=1-—2sin? 6 = 2/(1 + tan? 6) — 1. 

THEOREM 3.11. Forn > 4 and (k,n) = 1, the degree of 
tan 2rk/n is o(n), $(n)/2, or o(n)/4, according as (n, 8) 
< 4, (n, 8) = 4, or (n, 8) > 4. 

Proof. Case 1: (n,8) <4. Write 6 for 2xk/n. Then 
Theorem 3.9 says that 


d(cos 6) = d(cos 20) = 3d(sin 6). 
By Lemma 3.10 we conclude that 
Ricos ¢) = R(cos 26) C R(sin 4), 


and hence tan @€ R(sin 6). This with Lemma 3.10 im- 
plies that 


R(cos 6) = R(cos 20) C R(tan 6) C R(sin 6). 
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Also the degree [R(sin 6):R(cos 0#)] = 2, so that either 
R(tan 0) = R(sin @) or R(tan 6) = R(cos 6). But the lat- 
ter is impossible since it would imply that sin @ © R(cos 9). 
Hence 

R(tan 6) = R(sin6) and d(tan @) = d(sin 6) = ¢(n). 


Case 2: (n, 8) = 4. The proof parallels that in case 1, 
with the roles of sin 6 and cos @ interchanged. Specifically, 
we can write 

d(sin 6) = d(cos 26) = 3d(cos 6), 

R(sin 6) = R(cos 26) C R(cos 8), 
(8.4) tan 6 € R(cos 6). 
We continue the argument with 

R(sin 6) = R(cos 26) C R(tan 6) C R(cos 6), 
[R(cos 6): R(sin #)] = 2, 

and R(tan 6) = R(sin @) is impossible in this case since 
it leads to the conclusion cos @ € R(sin 6). Hence 
R(tan 6) = R(cos@) and d(tan @) = d(cos@) = ¢(n)/2. 

Case 8: (n, 8) = 8. In this case we will establish that 
R(tan 6) = R(cos 28), from which it follows that 

d(tan 6) = d(cos 260) = $(n)/4. 


Now R(cos 26) C R(tan @) by Lemma 3.10, and so we 
prove that R(tan 6) C R(cos 26) or that tan 6 © R(cos 26), 
which we do by induction on j, 2’ being the highest power 
of 2 dividing n. First for 7 = 3, the smallest value of 7 
to be considered since (n, 8) = 8, we note that 26 is of 
the type treated in case 2, and formula (3.4) would now 
be written as tan 20 € R(cos 26). It follows that sin 26 
€ R(cos 26), and so 
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jeg se Bey 
an SS. ee cos é 
1 + cos 26 


For any j > 3 we assume by the induction hypothesis 
that tan 26 € R(cos 46) and we must prove that tan @ © 
R(cos 20). Now R(cos 40) C R(cos 26) by Lemma 3.10, 
and consequently 


tan 26 € R(cos 26), sin 26 € R(cos 26), 


fie Riots) 
an @ = —————_ cos 26). 
1 + cos 20 


Thus the proof is complete, and we note the following 
consequence of Theorems 3.9 and 3.11. 


Coro.iary 3.12. If 6 ts rational in degrees, say 0 = 
2ar for some rational number r, then the only rational values 
of the trigonometric functions of @ are as follows: sin 6, cos 6 
= 0, +4, +1; sec 6, csc 6 = +1, +2; tan @, cot 6 = 0, 
+1. 
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CHAPTER 4, 


THE APPROXIMATION OF 
IRRATIONALS BY RATIONALS 


1. The problem. Given an irrational number a, it is 
clear that there are rational numbers h/k close to a, so 
that |a — h/k| is small. How small? Since by Theorem 
1.5 the rationals are dense, we can choose h/k so that for 
any arbitrary positive « we have |a — h/k| <e. If we 
presume k positive, this can be written as |ka — h| < ek. 
This inequality suggests, but does not solve, the problem 
of trying to select the integer k so that ka is arbitrarily 
close to an integer. Stated completely the problem is 
this: Given an irrational number a, can we find a positive 
integer & so that ka is arbitrarily close to the nearest in- 
teger; that is |ka — h| < e? The answer is yes (Theorem 
4,2), and we solve this and some related problems by a 
very simple method (the pigeon-hole method) in this chap- 
ter. Deeper results will be obtained in Chapter 6 by use 
of continued fractions. 


THEOREM 4.1. For any irrational a there exist infinitely 
many rationals h/k such that |a — h/k| < 1/k?. 

Proof. Let n be any positive integer. Consider the 
n + 1 real numbers 
(4.1) 0, a — [a], 2a — [2a], ---, na — [na], 
and their distribution in the n intervals 


j/nsz<jt+ l/r, j=0,1,2,---,n-1 
42 
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These 7 intervals cover the unit interval O < x < 1 and 
hence contain the n + 1 values (4.1). Hence two of the 
numbers (4.1) lie in the same interval; this is the pigeon- 
hole principle. Call these numbers nya — [na] and nea 
— [nga] with OS ny < ne Sn. Since the intervals are 
of length 1/n and are not closed at both ends, the differ- 
ence between the two numbers is less than 1/n; thus 


| nga _ [nga] — Na + [na] | < 1/n. 


Write k for the positive integer nz — 7, and h for [nga] — 
[nya]; so we have [ka —h| <1/n with k Sn. So far 
we have proved, then, that, given any positive integer n, 
there exist integers k and h, with n = k > 0, such that 


(4.2) |ka-—h| <1/n or |a—h/k| < 1/nk. 


The latter relation implies the inequality stated in the 
theorem, because n = k implies that 1/nk < 1/k?. 

To complete the proof of the theorem we must show 
that there are infinitely many such pairs of integers (h, k). 
Suppose, on the contrary, that there is only a finite num- 
ber, say 

(Ai, ky), (ha, ke), a) (As, k;). 


We prove that this supposition is false by finding another 
pair (h, k) satisfying (4.2). Define e as the minimum of 


Ja = hi/ky|, la — he/ke|, es | | a = h,/k,|. 


Since « is irrational, ¢ is positive. Choose n so that 1/n 
< e, and then by the first part of the proof which led to 
(4.2) we can find a rational h/k so that 


la—h/k| <1/nk S l/n<e. 


By the definition of ¢ it follows that h/k is different from 
h,/k; fori = 1, 2, ---, r, and this establishes the theorem. 


44 APPROXIMATION BY RATIONALS Ch. 4 


THEOREM 4.2. Given any irrational a and any positive 
integer n, there exist integers h and k withO <k Sn such 
that |ka — h| < 1/n. 

This result follows from the proof of Theorem 4.1, in 
particular the first relation (4.2). 


THEOREM 4.3. A real number a is irrational if and only 
if for any positive ¢ there exist infinitely many pairs of in- 
tegers (x, y) such that0 < jar — y| <.e. 

Proof. First, if @ is irrational, we can get infinitely 
many pairs (xz, y) from the pairs (k, h) of Theorem 4.1, 
using only those for which 1/k < «. Conversely, if a is 
rational, say a = a/b with b> 0, then |avt—y!| is a 
rational number for any pair of integers (x, y). Indeed 
|ax — y| is of the form u/b where u is some non-negative 
integer. Then, if we choose e < 1/b, we see that no pair 
of integers (x, y) can satisfy the inequality stated in the 
theorem. For, if (z, y) is a pair such that |az — y| > 0, 
then [ax — y| 2 1/b >. 


2. A generalization. It is natural to inquire whether 
the results obtained above can be extended to the simul- 
taneous approximation of irrationals: given a, and ae, 
can we find & so that both ka, and kag are arbitrarily 
close to integers? Again, can we find integers k, and ke 
so that the linear combination k,a; + keaeq is, in some 
specified manner, close to an integer? We shall generalize 
in both ways suggested by these questions. 

We say that (k1, ke, +--+, km) is a lattice point in m-space 
if its coordinates are integers. Given any rectangular 
array or matrix of real numbers a;; with j = 1, 2, ---, m 


andi = 1, 2, ---, n, we set up the n linear forms > a;;k;. 
j=1 


e] 
We establish that the lattice point (ki, ke, ---, km) can be 
chosen so that each of these n linear forms is arbitrarily 
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close to an integer. To say this another way, if we re- 
gard the n linear forms as coordinates of a point in n- 
space, this point can be located arbitrarily close to some 
lattice point (h;, he, ---, An) in n-space by suitable choice 
of the integers k;. This can be done in a trivial way by 
choosing each k; = 0; so we require that not all the k; 
vanish; that is we require that 2|k;| #0. We now state 
the proposition formally. 


TuHEorEM 4.4 For positive integers m and n let aj; be 
a collection of mn real numbers, with 1 = 1, 2, ---, n and 
j= 1, 2, ---, m. Let r 2 1 be any real number, and de- 
fine T = —[—7], so that T is the smallest integer not less 
than r. Then there exist lattice points (ky, ko, +++, km) and 
(hy, ho, +++, An) with (k;{ < T"/™ for all j = 1, 2, «++, m, 


and >> |k;| ~ 0, such that 
j=l 


Y asks — he <1/r for i= 1,2, -+:,n. 


j=1 


Proof. For any positive integer q there are (¢ + 1)” 


lattice points (y1, ---, Ym) with 0 S y; S q, and (¢ + 1)” 
corresponding points (w;, --+, w,) defined by 
we = Do oasys, #=1,2,---,n. 
j=1 


Let z; be the integer such that 0 S x; — w; < 1 for each 
a= 1, 2, ---, n; an alternative definition is 7; = —[—w,]. 
Thus we obtain a collection Q of (¢ + 1)” points defined 
thus, 


Q: (1 — @1, Yq — Wa, °""'; Ln — Wn), 


which are points of the unit cube in n-dimensional space. 
The unit cube referred to here is in effect defined by the 
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inequalities 0 < 2; ~ w; <1, so that it is a half-open 
cube. Divide this unit cube into T” smaller cubes of 
side 1/T by means of parallel hyperplanes, each of these 
subcubes being half-open in the same way. 

Next we set g = (T”/"] so that 


+1)" = (7) + 1)" > (T"™)™ = T*, 


Thus the (¢ + 1)” points of Q, being distributed in some 
fashion in the 7” subcubes of side 1/7, cannot all lie in 
different subcubes. At least two points of Q lie in the 


same subcube, say the two points (a1 — 1, 1°) En — Wn) 
and (t; — wi, °°", 2, — w,), Where w; = > ai;y;, and 
(Yi) ***y Ym) i is a lattice point distinct from in, -* +) Ym) 
but with 0 S$ y; Sq. Hence we have 
ce |2; — @; — 2; + a, | 
T a t % ‘3 
m 
=| D0 assuj — ys) — (@ — 2a) | $2 1,2, +++, 0, 
j=l 


These inequalities imply the conclusion of the theorem if 
we first write k; for y; — y;, hi for x; — x,;, and then use 
the fact that r < T. 

However, we must verify that the k; satisfy the in- 
equalities stated in the theorem. Since 0 S$ y; S q, and 
likewise for y;, it follows that 


ki} = |y;-— yl S¢=(7""] s T". 
Finally, since the lattice points (y,, ---, Ym) and (yi, -°-, 


Ym) are distinct, we note that k; ¥ 0 for at least one j, 
and hence 2[k;| 0. 


Corouiary 4.5. Given any real numbers a, a2, +++, Om 
and any integer t = 1, there exists a lattice point (ki, -+-, 
km, h) with |kj| S t for allj = 1, 2, ---,m, and Z|k;| ¥ 0, 
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such that 
| kyo, + kaa +- a Km&m = h| < 1/t™. 


Proof. In Theorem 4.4 we replace 7 by ¢”, n by 1, ay; 
by a;, and hy by h, so that 7 = ¢”, T"/™ = t, and the re- 
sult follows. 


TuHEoreM 4.6. Given any real numbers ay, ag, +++, On, 
there exist infinitely many sets of integers k, q1, G2, °**; Qn 
with k > 0 such that 


(4.3) ja: — qi/k| <1/(kWk) for i=1,2,+++,n. 


Proof. In Theorem 4.4 we replace m by 1, ay by aj, 
and we employ 7 only as a positive integer so that r = T. 
Thus we conclude that for any positive integer 7' there is 
a lattice point (k1, hi, he, +--+, An) with 0 < |k,{ S 7” 
such that 


lasky — hy] <1/T for i= 1,2,---,n. 


These inequalities also hold, obviously, for the lattice 
point (—k,, —h,, —he, --+, —hn); so select that one of 
this pair of lattice points having positive first member 
ky or —k,, and designate it by (k, q1, dz, -**, Qn). Thus 
we know that for any positive integer T there is a lattice 
point (k, g1, g2, ‘++, Gn), With 0 < k < TJ” such that 


(4.4) lak —q:| <1/T for i= 1,2, ---,n. 


This is equivalent to the inequalities (4.3) since we can 
divide through by the positive integer k and replace 1/T 
by the larger value 1/7/k. 

We must prove that there are infinitely many sets of 
integers satisfying (4.3). We separate the proof into two 
parts. First, if all of a1, ag, ---, @n are rational, the 
whole result becomes trivial because we can ignore all the 
above theory and simply choose k as any common inte- 
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gral denominator of the a; and then take g; = ka;. And 
there are infinitely many choices for such a k. 

Second, suppose that at least one of the a; is irrational. 
There is no loss of generality in assuming that a, is irra- 
tional. If there were only a finite number of sets of inte- 
gers k, qi, -**, Qn satisfying (4.3), then there would be 
only a finite set of corresponding values |a,k — qi|, all 
positive since they are irrational. But each of these 
values would exceed 1/7 provided we choose the positive 
integer 7 sufficiently large. This integer 7 would, by 
the procedure leading to (4.4), give us a different set of 
integers k, qi, «°°, Qn, and thus the proof of Theorem 4.6 
is complete. 


3. Linearly dependent sets. Theorem 4.6 can be 
strengthened in case the numbers a, ---, a» are linearly 
dependent over the field of rational numbers, and this we 
now do. 


TueoreM 4.7, Assume that only m of the real numbers 
@, Gg, °**, Gy are linearly independent over the field of ra- 
tional numbers. Then there exists a constant c and infi- 
nitely many sets of integers k, qi, +++, Gn with k > O such 
that 


las — qi/k| </(kWk) for 1=1,2,-++,n. 


Proof. Ifn = m, the result is Theorem 4.6 with c = 1. 
Hence we may assume that m <n, and we arrange the 
notation so that a;, ---, am are linearly independent, and 
each of am4i, °**, @, can be represented as a linear com- 
bination of these with rational coefficients. Using a com- 
mon denominator b > 0 for these rational coefficients, we 
have relations of the form, with integers a,;, 

(4.5) a= QyjQj for t= m + 1, cry Nn 


m 
=1 


1 
bj 
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Applying Theorem 4.6 to aj, --+, am, we see that there 
exist infinitely many sets of integers k’, q}, --+, Qj with 
k’ > 0 for which 


1 
(4.6) [aj — 4) coe peg Fr F= 12 erym 


For i > m we have, using (4.5) and (4.6), 


1 m 
ay > 05595 rl x Ajjaj —— bk’ =p 059; 
3 (oF) 
—_ < 
re - )< 25 


- oe" a;;|. 
We define k = bk’, g; = bg; fori = 1, ---, m, and 


(4.7) 


Gj 


ed 


™m 
qi = do aig; for i=m+1,-++,2, 


j=1 
so that there are infinitely many sets of integers k, qi, qa, 
**,; Qn. With this notation, (4.6) can be written as 


bb 
kwW/k 


ay < for j=1,--+,m, 


and (4.7) as 


Vo - 2 |a;;| 


for «=m-+1, 


az; 


| 
k | 78 


These two inequalities establish the theorem if we define 
c as the maximum of the n + 1 — m numbers 
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bv, Vb» > fasl, t=m+ ern. 


j=1 


Notes on Chapter 4 


All the arguments here are based on the pigeon-hole principle of 
Dirichlet: if there are » + 1 objects in n boxes, there must be at 
least. one box containing two or more of the objects. Theorem 4.1 
can be improved, and this is done in Chapter 6 (Theorem 6.1) by 
the use of continued fractions. 

The results, both here and in Chapter 6, are only a smal] sample 
of the work that has been done with this topic; cf. J. F. Koksma, 
Diophantische Approximationen. An excellent exposition is to be 
found in G. H. Hardy and E. M. Wright, Theory of Numbers, Chap- 
ters 11, 23, 24. 


CHAPTER 5 


CONTINUED FRACTIONS 


1. The Euclidean algorithm. Consider a pair of in- 
tegers Ug and w,, with uw, ¥ 0 and (uo, u;) = 1. The di- 
vision algorithm shows that, if uo is divided by ™, there 
is a unique quotient [up/u,] and a unique remainder, say 
U2, with O S ug < wy. If we ~ 0, the process continues 
with %; divided by ue, and in this way we get the Euclidean 
algorithm 
Ug = Uy[Up/u] + U2 


Uy = Ug[u;/uUe] + us 


(5.1) Uy = U{uiy/u] + Usqs 
Um—1 = Um[Um—1/Um) + Um 41 


Umn+1[Um/Um+i]- 


Um 


The remainders satisfy the inequality 0 < u,41 < u; for 
1sism. The last non-zero remainder, u,z41, is the 
g.c.d. of up and uw, and so by hypothesis um 1; = 1. In 
the special case u; = 1, up/u; is an integer and m = 0. 

If we write &; = u,/u;41 and a; = [é,] for 0 Si Sm, 
we can put the equations (5.1) in the form 


(5.2) &1=a@it Wi, LStSm; kn = am, 
51 
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By eliminating £ from the cases 7 = 1 andi = 2 of equa- 
tions (5.2), and then eliminating & from the result and 
the case i = 3 of (5.2), and continuing this process until 
finally £,, is eliminated, we obtain 


1 
(5.3) f = do + - a 


. 1 
+ 
an—~1 + 4 
am 


This is a continued fraction expansion of ). The integers 
a; are called the partial quotients, since a; = [u;/u,41) for 
0<i<m. Now a may be positive, zero, or negative, 
but a; 2 1 for 1 $7 Sm, since uj4, <u; Also we note 
that @n > 1 if m = 1, because an = Em = Um/Um41 and 


Um > Um4i- ; 
Notation. For any real numbers 2p, 21, -°*, Zn With 
x; > O if ¢ > 0, we shall write 
1 
(5.4) [Zo, Z1,°""; Ln] = Xo + 
21+ 
7 , 

+ ts 

Tn—-1 + ye 

In 


This finite continued fraction is said to be simple in case 
the 2x; are integers, all positive except perhaps x. For 
example, the right side of (5.3) is a finite simple continued 
fraction, and so it could be represented by the notation 
(a0, @1, +++, Onl. 

We shall make free use of the formulas 
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[Zo, T1,°""; En] Sto FS 
[z1, To, °"*; Ln] 


1 
= zo, T1) °° *y Tn—2, Ln—1 + =|. 
Zn 
This notation (5.4) for a continued fraction is in very 
minor conflict with the greatest integer notation, as fol- 
lows. Setting n = 0 in (5.4), we have [zp] = 29, and this 
is inconsistent with the greatest integer notation when- 
ever 2 is not an integer. However, this will cause no 
difficulty because our primary concern is with simple con- 
tinued fractions. In any case, the meaning will be clear 
by the context. 


2. Uniqueness. The simple continued fraction ex- 
pansion (5.3) of & is not unique, because 
(5.5) & = {ao, Gy, ***, Om—1), an] 
= [ao, Q1, °°", am—1, Im — 1, 1). 
Note that this equation is correct even if &) is an integer, 
in which case m = 0 and ado = &. We now establish that 


these are the only simple continued fraction expansions 
of any rational number £o. 


Lemma 5.1. If two finite simple continued fractions are 
equal, say [ao, a1,°""; Om] = (bo, bi, a) bn, and af am > 1 
and b, > 1, then m = nanda; = b; for0O Sis n. 

Proof. Writing y; for [b:, bi41, °--, bn], we see that 


(5.6) ys—1 = [bi-1, bs, +++, br] 


1 1 
= ba + ee 0 t+ 
[B;, Fa bn] Yi 
Hence we have y;_; > b;_1 and y;_1 > lfor2 Ssisn, 
and yn = b, > 1. It follows that b; = [y,] forO <7 < n. 
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Now if we identify yp = [bo, ---, bn] with &, we can com- 
pare equations (5.6) and (5.2). First. we have that ag = 
[Eo] = [Yo] = bo. Next we obtain 


1 
— =f —do = Yo —b = — 


& Yi 
& = 1, a = [1] = [y:] = b1. 


This process continues by mathematical induction: from 
&:-1 = Yi_1 and a;_; = b;_ it follows that 


1 1 
— = &y — Gy = Yi — a = 
g; Yi 


&=y, a; = (&] = fy] = 0; 


Furthermore we have that m =n. For suppose that 
m <n, say. The induction process would bring us to 
the results nm = Ym, Qm = bm, and these would stand in 
contradiction to ~m = Gm and Ym > bm from (5.2) and 
(5.6). The case m > n leads to a contradiction in a 
symmetrical way, and so the lemma is proved. 


THEOREM 5.2. Any finite simple continued fraction rep- 
resents a rational number. Conversely, any rational num- 
ber & can be expanded in a finite simple continued frac- 
tion in exactly two ways. 

Proof. The first remark is obvious. Concerning the 
converse, the “two ways” are given by equation (5.5). 
By using this equation to eliminate 1 as a last partial 
quotient, we then apply Lemma 5.1 to establish the theo- 
rem. 


3. Infinite continued fractions. Let bo, b;, be, --- 
be an infinite sequence of integers such that b; 2 1 for 
«21. Define the integers h, and k, by the equations 
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h2=0, hiy=1, he =bdhiathiip for 720, 
(5.7) 
k_p=1, ky =0, kj = bAks_y + h:_2 for 7 


IV 
—) 


It is clear that 1 = ko Sky <hkg < 
Lemna 5.3. If x is any positive real number, then 


Lhn_1 + hae 


fe Pyaict Bucy gl ee ee 
[bo, by 1, 2] a eee ae 


Proof. We use induction. For » = 0 the result re- 
duces to [z] = x by use of equations (5.7), where [z] here 
means the continued fraction with only one partial quo- 
tient. Ifn = 1, we have 


abo + 1 tho + hy 


bo, z] = b pia er. 
[bo, t] = bo . aera 
Assuming the result for (bo, +--+, ba—1, 2], we can write 
1 
[bo, peas bat, bn, x)= bo, a) Bn-1, ba + i 


1 

(6, + ~) hai + hn—2 
z. 
1 

(6, + ) Kn—1 + Kn-2 
z. 


_ &(bnhn—1 + Rn—2) + hint 
~ 200nkin—t + hn—a) + Fat 
_ thy + An—1 
es + Kn 
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Lemma 5.4. If we define z, = [bo, b1, -+-+, bn] for 
n = 0,1, 2, --+, then 2, = ha/Kn. 


Proof. Replace x by b, in Lemma 5.3, and apply (5.7). 


Lemma 5.5 Ford = 1 we have 


; (-1)*? 

hike_y — Aeeik: = (-1)*7? and 2; — 2;.. = -———— 
kki_1 
For 1 = 2 we have ; 

: (—1)*b; 

hikig — Aiaki = (-1)"; and 4% — 2 = : 
kikj_2 


Also the fraction h,;/k; is in lowest terms. 
Proof. From (5.7) we see that hyk_2 — h_2k_1 = 1, 
and by mathematical induction 
hike — hiky 
(bir + hi_e)kivsr — Aa (bik. + hie) 
—(hi-tki-g — hi-2ki1) = (—1)*1. 
Thus the first result of the lemma holds even for 1 = —1 
andi=0Q. The second result, the formula for 2; ~ z2;_1, 
is obtained by division by k,;k;_;, and here the restriction 
4 = 1 is needed since k_, = 0. 
Analogously we can write hok_2 — h_2ky = bo, and by 
induction 
hiki-g — hi_ok; 
= (bi + hi_e)ki-g — Ai_o(biki_a + bye) 
= bi(hiaky2 — hi_gki1) = (—1)*b:. 
Division by k,;k;_2 will then yield the final formula of the 


lemma. Finally, it is clear that h;/k; is in lowest terms, 
since any divisor of h; and k; is a divisor of (—1)'7!. 


Lemma 5.6. The values z; satisfy the inequalities 


20 < ze << % See < 25 << 23 < 24. 
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Furthermore lim 2, exists, and zo; < lim 2, < ze:41 for any 
non-negative integer 2. 

Proof. Recalling that k; is positive for t = 0, and 5; is 
positive for 7 2 1, we see by Lemma 5.5 that zo; < Zei42, 
Zai-1 > Zei41, and 2g; < 2q;-1.. Thus the monotonically 


increasing sequence Zo, 22, 24, -:* is bounded above by 2), 
and so tends to a limit. Likewise the monotonically de- 
creasing sequence 21, 23, 25, -** is bounded below by 2p, 


and so tends to a limit. These limits are equal because 
Lemma 5.5 also implies that 2; — z;_, tends to zero with 
increasing i, since the integers k,; are increasing. 

The chain of results which we have obtained in this 
section suggests the following definition of an infinite sim- 
ple continued fraction and its value. 

DEFINITION. Let bo, 01, be, --- be an infinite sequence 
of integers such that b; = 1 fori 2 i. The limit of the 
finite simple continued fraction [bo, b;, -++, bn] as n in- 
creases indefinitely is called an infinite simple continued 
fraction, with the notation [bo, bi, be, --°]. 

Thus we have 


(5.8) [bo, bi, be, «+ +] = lim [bo, by, +++, bal 


n> 0 
a ee 

= lim — = lim a, 
n>» Ky nw © 


where hy, kn, and z, are defined in (5.7) and Lemma 5.4. 
The rational number h,/k,, or in alternative notation z, 
or [bo, 61, ---, bal, is called the n-th convergent to the infi- 
nite continued fraction. 


THEOREM 5.7. Any infinite simple continued fraction 
[bo, 61, be, ---] 48 an irrational number. 

Proof. Writing @ for [bg, 61, be, ---], we see by Lemma 
5.6 that @ lies between consecutive convergents z, and 
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2n41: that is, between An/ky and hazi/kn4i. Thus we 
have, by Lemma 5.5, 


0 < | kn@ = hy} = kn) 6 — 2n| < Kalen = Zn| 
1 1 


=k, 


7 kkn41 Kn4i 


Once again we make use of the fact that the k, are posi- 
tive integers, Increasing with n. Given any positive e, we 
see that we can, by taking all sufficiently large n, find in- 
finitely many pairs h, k such that 0 < |ko—A| <.. 
Hence @ is irrational by Theorem 4.3. (This is the only 
place in the present chapter where a proof depends on a 
result from an earlier chapter. However, it may be noted 
that the dependence is on the trivial part of Theorem 4.3, 
the proof of which follows the statement of that result.) 

We now establish that two different infinite simple con- 
tinued fractions cannot have the same value. 


Lemma 5.8. Let @ = [bo, by, be, +++]. Then bo = {6}, 
where by [6] 1s meant the greatest integer not exceeding 0. 

Proof. By Lemma 5.6 we have z < 9 < 2,, which is 
the same as bb) <@< bo + 1/b;. But b; 21, so that 


bo < 6 < bo + 1, and this completes the proof. 


Lemma 5.9. With @ as in the previous lemma, and 6, = 
[b,, ba, oe ‘], we have 0 = bo + 1/6. 

Proof. Note that 6; #0 by Theorem 5.7. We can 
write 


6= lim [bo, bi, Shs b,) = lim {bo + 


nw>o n> 2 


oad 

[bi, rn | 
1 1 

a ae 


TueoreM 5.10. Two distinct infinite simple continued 
fractions represent different irrational numbers. 
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Proof. Suppose that [bo, bi, be, ---] = [ao, a1, ae, ---] 
= 6. Then Lemma 5.8 states that [6] = bp = ap. Next 
we apply Lemma 5.9 to show that [b,, be, --+] = [a1, ae, 

--].. This process continues by mathematical induction, 
so that a, = b, for all n. 


4. Infinite continued fraction expansions. The 
converse of Theorem 5.7 is also true: any irrational num- 
ber can be expanded into an infinite simple continued 
fraction. To establish this, let & be any irrational num- 
ber. Then define the sequence of integers dp, a), dz, °*-, 
and the sequence of irrationals £1, 2, &, --: by the equa- 
tions 


i 


ao = [Eo], £ = 1/(& — ao) 
a, = [&], 2 == 1/(£ — ay) 


CY 


(5.9) 


a; = [é,], figs = 1/(& — ai). 


The relation §_1 — a1 = &—1 — [§:_,], together with 
the fact that &;_, is irrational, indicates clearly that 
0 < &_1 — a;_y <1, whence & = (&:) ~ ajy_1)71 > 1 
and a; = [€,] 2 1 for all z 2 1. 


Making use of (5.9) in the form £;_, = a;_; + 1/&;, we 
see that 


1 1 
0 = Ay + — = A + ———— = [4, ay, £5]. 
fy a, + 1/t, : 
Continuation of this process by mathematical induction 
gives £) = (do, @1, Ge, ***, Q@n—1, &n]. We now identify 


the a; with the b; of the previous section; that is, we write 
ay = bo, a1 = by, etc. Then Lemma 5.3, with x replaced 
by &,, states that 


Enhn—1 + ha. 
(5.10) ko = Ido, @, °° *', Gn—1, En] = 2, 


Enk, —l + kn-2 
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The values h; and k; satisfy recurrence relations (5.7), 
which can be rewritten as 


(611)° he = aia + ya, bee Sides tees 

Thus we have, using Lemma 5.5, 

An—1 — Enltn—1 + In—2 Int 

nt Enna + hn Bent 

—(hn—thn—2 — An—akn—1) 
kn—1(Enkn—1 + kn—2) 

ee as: 

~ kn—1(Enkn—1 + In—2) 


Hence & — 2n_) is positive for odd values of n, and nega- 
tive for even values; that is 


(5.18)  22n = Ron/kan < bo < hon—1/kon—1 = Z2n-1- 
This implies that 


fo = lim zy = [dp, @1, G2, --°], 


na>2 


(5.12) & — @n—1 = bo — 


so that equations (5.9) constitute an algorithm for deter- 
mining an infinite simple continued fraction expansion of 
any irrational &. Uniqueness has been established in 
Theorem 5.10; so we can summarize as follows. 


TueEoreM 5.11. Every irrational number &) has a unique 
representation as an infinite simple continued fraction [ao, 
@1, G2, -+-], and conversely. The integers a; are positive for 
421. The n-th convergent, hn/kn, is the finite continued 
fraction (ag, 1, -*+, Qn]. The denominators ky, form a mono- 
tonically increasing sequence of integers, forn 21. The 
even convergents, 1.€., hn/kn for n = 0, 2, 4, --+, are mono- 
tonically increasing with & as a limit; the odd convergents 
are monotonically decreasing with & as a limit. 
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5. The convergents as approximations. It will be 
convenient in the sequel often to drop the subscript from 
£), and write simply £ for the irrational number with con- 
tinued fraction expansion [ap, @1, @2, --+] and convergents 
An/kn. 


Lemma 5.12. For any n 2 0, we have 
|é — halkn| < 1/knkn+1- 
Proof. We use (5.12), (5.9), and (5.11) to write 


hy 1 1 
£ _—_— SS ee < ee 
| kn kan(Enaikn + kn-1) Ka(Qn4ikn + kn—1) 
_ 1 
- Kakn+a 


Next we establish that the convergents form a chain of 
consecutively better approximations to an irrational 
number. 


TuHEoreM 5.13. The convergents hn/kn of the continued 
fraction expansion of an irrational number & satisfy, for 
n 21, 


Ble 
E a 3 oa 
|&k, J ha| < [Ekn1 = ha—1|. 


Proof. We establish the second inequality, whence the 
first is true a fortiori since the positive integers k, in- 
crease with n, except for the possibility kg = ki. By (5.9) 
we note that a, + 1> &. Hence by (5.11) we have 


Enkn—1 + Kn_2 < (dn + kn-1 + kn 


=k_t+tkhi1 S Gn4ikn + ky = Kn+1y 
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for alln 21. This with (5.12) implies that 


Ant | 1 1 
Kn-1 


= ——  —_ > 
ky-1 (Enkn—1 + kn—2) Kn—tkn41 


Multiplication by kn; gives [kn1& — Ani] > kad for 
alln = 1. But Lemma 5.12 says that |knf — hn| < kaj. 
These two inequalities establish the lemma. 

The convergents h,/k, are the best approximations to 
an irrational £ in the following sense. 


TueEoreM 5.14. If there is a rational number a/b with b 
positive such that |& — a/b| < | — ha/kn| for some n > 0, 
then b > ky. Indeed, if there ts a rational number a/b with 
b positive such that |bE — a| < |knt — hal for somen > 0, 
then b > kn. 

Proof. The second assertion implies the first, as the 
following argument shows. Assume that | — a/b| < 
| — An/ky| and that b < k,. Then the product of these 
inequalities gives |bé — a] < |k,E—h,z|, and by the 
second part of the theorem it would follow that b > ka, a 
contradiction. 

In order to prove the second part of the theorem, we 
assume that |b& —a| < |knf — A,z| and that b < k,. 
Hence b < knii, since n >0. Integers x and y exist 
such that 


b = tha + yhasi, @ = tha + yhnsi, 


because the determinant of coefficients is +1 by Lemma 
5.5. Next we argue that neither z nor y vanishes. First, 
if y = 0, then a = ch, b = rkyn, x ¥ 0, and 


[bé — a| aa [z|-|kné — hal = lkné — hal, 


contrary to hypothesis. Second, if x = 0, then y ¥ 0 and 
b = yknai, which contradicts b < kad. 
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Furthermore, x and y are of opposite signs. If y < 0, 
then the equation zk, = 6 — yk,4; shows that z > 0. If 
y > 0, then b < ykyj41 so that zk, <O andz<0. By 
(5.13) 


kn— — ha and knsif — Angi 

have opposite signs, and so 
a(knéE — hn) and Y(kn41& oa hati) 
have the same sign. It is also clear that 
bE — a = (kak — ha) + y(hn ik — hays). 
Taking absolute values, we get 
[bE — a| = [x(ken€ — hn)| + ly(kn4ié — hays)| 
> |2(kn& — hn)| 2 |kné — Anal. 


Thus we have a contradiction, and the theorem is proved. 

The assumption n > 0 is needed in the theorem. Con- 
sider the case £ = 4/3, a = 2,6 = 1. Then ho/kp = 1/1, 
and 


| — a/b] = |V3— 2] < |VW3—1| = [E— ho/kol, 
whereas b = 1 = kp. 


6. Periodic continued fractions. We say that an 
infinite simple continued fraction [a9, @1, d2, «+ +] is periodic 
if there exist integers n and s such that a, = a,4, for all 
r > s. For example, (2, 3, 4, 5, 4, 5, 4, 5, ---] is periodic. 
To evaluate such a continued fraction, write @ for the 
purely repeating part, @ = [4, 5, 4, 5, ---] = [4, 5, @]. 
Hence we have @ = 4+ (5+ 67")71, so that 6 = (10 + 
2+/30)/5. Then the original fraction is [2, 3, 4, 5, 4, 5, 
-+-] = [2, 3, 6] = (76 + 2)/(36 + 1), which is readily seen 
to be a quadratic irrational. Another example is a = 
[1, 1,1, ---] = (1 + V/5)/2, a value that can be found by 
means of the equation a = [1, a] = 1+ a7}. 
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Any periodic continued fraction is quadratic, as we now 
show by an extension of the above argument. Let us 
use the notation 


& = [Bo, bi, ++ -, Be, Qp, @1, ***, An—1, Mp, 1, °**, Gn—1; °**]; 
and write 
6 = [do, G1, -**, Gn—1, Qp, G1, °**, Qn, *°°] 

= [do, @1, ***, Gn—1, 9]. 


Then formula (5.10) gives 


_ ORn—1 + Rn—2 
Okn—1 + kno 


Thus @ satisfies a quadratic equation with integral coeffi- 
cients, so that 4 is either rational or a quadratic irrational. 
But @ is an infinite continued fraction and so is irrational, 
and hence a quadratic irrational. Returning to &, we have 


pe + p’ 
£ = [Bo, by, «++, bs, A] = rid 
go +4q 
where p/q and p’/q’ are the last two convergents to [bo, 
bi, --*, bs). Again we can argue from this last equation 


that ¢ is either rational or a quadratic irrational, and the 
former is ruled out because & is an infinite continued 
fraction. 

We have proved the first part of the following result. 


THEOREM 5.15. Any periodic simple continued fraction 
as a quadratic irrational, and conversely. 

Proof. The first part of the proof preceded the state- 
ment of the theorem; so now we consider a quadratic ir- 
rational § = & = (a+ +/b6)/c. Here b is a positive in- 
teger, not a perfect square. The integers a and c may be 
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positive or negative, and so the form given to & is per- 
fectly general. Next, may be written in the form 


ac + Vbc? 
ae ii! 


c 


to if ¢>0, or 


—ae + Vibe? . 
& — fh <0, 
—¢c 


Noting that c? is a divisor of bc? — (ac)”, we may write 
g 


_Po+ VD 
~— Q 


where Po, Qo, and D are integers, D > 0, D not a perfect 
square. 

We now establish that the infinite simple continued 
fraction expansion of é is [ag, a1, @2, ---], where the a; 
are defined by the following recursion relations: 


0 » with Qo|(D — P), 


P:+ VD 

(5.14) a;=(&], & =———— 
Q: 

D — Pix. 

Q: 
First we note that P; and Q; are integers for all 7 2 0. 
To prove this, we use induction, assuming that P; and 
Q; are integers such that Q;|(D — P?). Then P,4; is an 
integer, and the equation 


5 D— Pi, D— P? + 20,Q;P; — a7Q? 
1 >= a 


Q: Q: 


shows that Q;4; is an integer. Furthermore, we have 


Pia = a,Q; — Pi, Qi41 = 
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Q; = (D — Pi41)/Q:41, so that Q;11|(D ~— P?,,), and the 
induction is complete. Next we note that 


_Pi+ VD ~ 4 Pix aiQi + VD 


¢ + 


Q: Q: 
VD. Pia D— Py 
= FE i SS 
Q; Q(VD + P41) 
+ : thew 
= a; —_—————— a; . 
VD + Pian E44 
Qi41 


This equality together with a; = [£;] proves by (5.9) that 
£ ad [ao, 1, Ag, °° | = [ao, 1, ***y An—-1, El]. 


By f and £, we shall mean the conjugates of & and &,: 
that is, what is obtained from these when -D is replaced 
by —VW/D. Taking conjugates in (5.10), we have 


ro Enhn-1 + hn—2 


a Enkn—1 + Kn-2 
Solving for &, we get 


kn-2 (2 on ve), 
kn—1 \€o — Ihn—i/kn—1 


Now lim h,/kn = &, so that the term in parentheses tends 
to 1 as n tends to infinity. Hence for 7 sufficiently large, 
say n = N, the term in parentheses is positive and £;, is 
negative. But & > 0 for n > 0, and so & — &, > 0 for 
n2N. By (5.14) this says that 2\/D/Q, > 0, and so 
Q, >0 forn2N. But by (5.14) we have QnQn41 + 
P?,, =D. Hence, forn = N we conclude that Q, < D, 


— 
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|Pnai| < “WD. Thus the P, and Q, range over a finite 
set of values, and so there exist distinct subscripts j and 
k so that P; = P;, and Q; = Q,. Hence £; = &, and the 
continued fraction expansion [a, a1, -+-] is periodic. 


Notes on Chapter 5 


The original draft of this chapter was revised extensively, par- 
ticularly the order of the presentation and development of proofs, 
along lines suggested by Professor H. S. Zuckerman. The proof of 
Theorem 5.15 is Zuckerman’s, We have included a little more ma- 
terial than is needed for the next chapter, in order to give some 
measure of completeness to the treatment. The classic work on 
continued fractions is O. Perron, Die Lehre von den Kettenbriichen, 
Leipzig, Teubner (1929). An interesting introduction to the sub- 
ject is given in Chapter 10 of Hardy and Wright, and a more ele- 
mentary formulation in Chapter 4 of H. Davenport, The Higher 
Arithmetic. 


CHAPTER © 


FURTHER 
DIOPHANTINE APPROXIMATIONS 


1. A basic result. This section is devoted to an im- 
provement of Theorem 4.1, and this improvement is shown 
to be best possible in the next section. The last part of this 
chapter deals with the distribution of the fractional parts of 
the positive integral multiples of an irrational number. 


THEoREM 6.1. For any irrational & there exist infinitely 
many rational numbers h/k such that 
: h 1 

k| 5k? 

Proof. We make use of the continued fraction expan- 
sion [@o, 41, @e, ---] of &, and the convergents h,/k, there- 
to. The proof consists in establishing that, of any three 
consecutive convergents, at least one satisfies the in- 
equality of the theorem. (Actually the proof is not valid 
for the first three convergents, ho/ko, hi/ki, and he/ke, 


one of which satisfies the inequality nevertheless.) Write 
€n+1 for kp_3/kn, so that by (5.12) 


1 1 
- Kn (En ihn + kn—1) 7 RiEn 41 + Cnt) 


A comparison of this with the inequality of the theorem 
indicates that the question turns around the relative sizes 
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of én41 + Cn4i and ~/5. We shall assume that &; + ¢; 
< +/5 for three consecutive values of 7, namely 7 = n — 1, 
n, n+ 1, and from this will arise a contradiction, which 
will establish the theorem. 

First we note from (5.11) that 
1 kya) Gia ki-e + ki 


= — = aj_y ty. 
ci hye ki-2 


Also from (5.9) we recall that :-1 = @n—1 + 1/é, and 
we eliminate a,—, from this and from formula (6.1) in the 
case 7 = n, to obtain 
I 1 1 1 
—+— = 1+ fi S V5, — 5-—: 
Cn bn En Cn 
But & < 5 — cn by hypothesis, and, since &, > 0, we 
can multiply these two inequalities to get 


1 1 
1s6-V5(%+—): at+—s V5. 
c Cn 


Te 


The equality sign may be deleted here since ¢, is rational. 
Also c, is positive, and we use it as a multiplier to arrive 
at 


5 coal | 
a+til—- Vdc, <0, (Ye ~a) <+ 


Furthermore c, < 1 by Theorem 5.11, whence we can take 
square roots, 
V5 1 1 
— —-t <= Cn > — (V5 — 1). 
- 3 a>5 (V5 — 1) 
We have obtained this result by use of the casesi = n — 1 
and i =n of the inequality &; +c; S$ 5. The latter 
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inequality was assumed also for the case 7 = n + 1, and 
so we can also draw the conclusion that 


Cnt > (V5 — 1), Cat < $(W5 + 1). 


Using these results together with formula (6.1) in the 
case 1 = n + 1, we obtain 


an = 


Cn 41 2 2 
which contradicts Theorem 5.11. 

Three consecutive convergents were used in the proof, 
and this number cannot be reduced. For example, if & = 
(4 — +/2)7, then ap = 0, a, = 2, a2 = 1, and the con- 
vergents hy/k, and h2/ke, with values 3 and 3, fail to 
satisfy the inequality of Theorem 6.1. 


2. Best possible approximations. The constant 
+/5 in Theorem 6.1 is best possible in the sense of the fol- 
lowing result. 


TuHroreM 6.2. The constant ~/5 in Theorem 6.1 cannot 
be replaced by any larger value. 

Proof. There are values of & for which the constant 
4/5 in Theorem 6.1 can be improved, but £ = 4(./5 4+ 1) 
is not such a value, and this is the irrational which we 
use to establish Theorem 6.2. 

Assume, then, that there are infinitely many rationals 


h/k for which 
| V5+1 4h 1 
2 k| © ck? 
where ¢ is some constant exceeding 1/5. Define z, a 
function of h and k, by the equation 


V5+1 A 1 


2 ko tk? 
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so that |r| > ¢ > +/5. Then we can write 


and, squaring and simplifying, we have 


1 V5 


cee ee 
7k? x 


The right side of this equation is an integer, and so must 
the left side be. But the left side has absolute value 

1 V5 1 V5 1 V5 
el =|pe +O <p. 
a*k x xk x k c 
Now +/5/c < 1; so we can choose k sufficiently large 
that 1/k? + +»/5/e <1. For such a value of k, and the 
corresponding h, it would follow that 


W-—hk—k? =0. 


We may presume that h/k is in lowest terms, and it fol- 
lows that the last equation is impossible in positive inte- 
gers since it implies that any prime factor of k is a prime 
factor of h. 


3. Uniform distributions. Let S be a sequence of 
numbers, or points, a, a2, a3, --+, in the unit interval, 
so that 0 S a; S 1 for all z. Let J be any subinterval of 
the unit interval. The symbol J will be used to denote 
either the subinterval or the length thereof, the context 
indicating which. Let n(I) denote the number of those 
points aj, a2, a3, °**, a that lie in the interval 7. The 
sequence S is said to be uniformly distributed in the unit 
interval if for every subinterval J, 

lim my) = Tf, 


n7>o nN 


s 
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Let & be any sequence of rea] numbers, 81, Be, 63, ---, 
and recall the notation (8) = 6 — [6] for the fractional 
part of 8. The sequence F is said to be untformly dis- 
tributed modulo 1 if the sequence (61), (82), (83), «°° is 
uniformly distributed in the unit interval. 


THEOREM 6.3. If & 1s irrational, then the sequence &, 2, 
3, 4, --- ts uniformly distributed modulo 1. 

The conclusion is clearly false if & is rational; so the 
theorem could be stated as a necessary and sufficient con- 
dition for irrationality. First we give a proof by con- 
tinued fractions, and in the next section an alternative 
proof by Fourier series. 

Proof. Let any positive « and any subinterval J of 
the unit interval be given. As earlier, let n(Z) denote 
the number of those points 


(é), (2é), (3é), eae (né) 
that lie in J. We prove that 
n(I) 


maa] <e 
n 


(6.2) 


for all sufficiently large n. 

Let the convergents of the continued fraction expansion 
of £ be ho/ko, hi/k1, ---. By Theorem 5.11 the k, are in- 
creasing positive integers, and so we can locate n, if not 
trivially small, by the inequalities 


(6.3) kk, Sn < ky kh. 


This relation gives 7 as a function of n, and, as n tends to 
infinity, so do t and k;. We shall prove (6.2) for all n 
large enough so that the corresponding 7 value satisfies 


(6.4) ki < : and 2k <I. 


Sec. 3 UNIFORM DISTRIBUTIONS 73 


The idea of the proof is to work with the convergents 
h,/k; in place of —. This we do by using a subinterval I’ 
of I defined as follows: I’ is the interval I with a segment 
of length k; * removed from each end. The equation for 
lengths is thus 


(6.5) ls I — 2k,-%. 
Define g by 
(6.6) q=[n/kJ], sothat gk; Sn < (q+ 1k. 

The fraction h;/k; is in lowest terms by Lemma 5.5 so 
that the k; integers 

hy, Qhi, +++, hs 
form a complete residue system modulo k;._ Thus the points 
(mh;/k;) for m = 1, 2, +--+, k; 

are merely some arrangement of the points 

1 2 k;-—1 
ie i ae . : 
These points being uniformly spaced, we can say that at 


least k,J’ — 1 of them lie in the interval J’, the 1 being 
subtracted to allow for end effects. Next, the points 


(mh;/k;) for m = 1, 2, ---, qk; 
are the points (6.7) in some order repeated g times, so 
that, of these gk; points, at least q(k,J’ — 1) lie in the in- 


terval I’. 
We now prove that, for any value of m for which 


(mh,/k;) is a point in J’, the corresponding point (mé) lies 
in I, Using Lemma 5.12, (6.3), and (6.6), we see that 


gk; 
kikign — Keikiqa 


(6.7) 0 


< kz, 


™| = F “|< 
me — % =m EB 
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This inequality says that the distance between the points 
mé and mh;/k; is less than k; “4, and, from the definition 
of 7’, we see that mé and mh,/k; do not span an integer. 
Hence the distance from (mé) to (mh;/k;) is also less than 
k; ”, so that (mé) belongs to J. This enables us to re- 
formulate the conclusion of the last paragraph thus: that, 
of the points (£), (2), ---, (kt), at least g(k,J’ — 1) be- 
long to the interval J. But n 2 qk; by (6.6), and hence 
we have 
nZI) = g(kit’ — 1). 


Application of (6.5) and (6.6) yields, since J S 1, 
n(I) = gk’ —~q > (n— ky) (I — 2k7 %) — nk 
> nl — k; — 3nk; 4, 
m >I- ae Bkz74, 
n 
By (6.3) and (6.4) we have 
 <kt< , and 3h; < 3k < = 


and these can be applied to the previous result to give 
nl € 
ley a 


(6.8) 
n 


Denote by C the complement of J in the unit interval: 
If J is the interval (a, b), C would consist of the intervals 
(0, a) and (b, 1). The inequality (6.8), interpreted for 
C, would be written 


ney 2 


n 


C>—-., 


where n(C’) is the number of those points (é), (2£), ---, 
(nf) that lie in the intervals C. The change from ¢/2 to 
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e is to account for two intervals making up C. It is clear 
that n(J) + n(C) = n, and the lengths satisfy the rela- 
tion J+ C =1. Thus we can get rid of C in the last 
inequality, 

n- nl nt 

eae CL eg or OD pds 

n 

This, together with (6.8), implies (6.2), and the theorem 
is proved. 


Coroxuary 6.4. The number & is irrational if and only 
af the points (£), (2£), (8§), --- are everywhere dense in the 
umit interval. 

Proof. If & is irrational, the conclusion follows at once 
from Theorem 6.3. If & is rational, say § = a/b with 
b > 0 and (a, b) = 1, then the point set is discrete. It is 
in fact the finite set of points, 0, 1/b, 2/b, ---, (6 — 1)/b. 


4. A proof by Fourier analysis. We now give a 
more sophisticated proof of Theorem 6.3 by the use of 
trigonometric sums. The method, due to H. Weyl, has 
been used extensively in the theory of numbers since it 
was first introduced. This section is in effect an appendix 
to the chapter since we assume a requisite proposition 
concerning Fourier series. Thus we state the following 
result without proof; it may be found, for example, in 
Dunham Jackson, Fourier Series and Orthogonal Polyno- 
mials, Carus Monograph 6, p. 21. 


Lemma 6.5. The Fourier series for a broken-line func- 
tion converges uniformly to the function for all values of the 
variable. 

Note: By a “broken-line function” is meant a continuous 
periodic function whose graph in any one period interval 
consists of a finite number of straight-line segments of 
finite slope. 
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The next result is a special form of Weyl’s criterion for 
uniform distribution. 


THEOREM 6.6. The sequence of real numbers B,, Bo, --- 
ts uniformly distributed modulo 1 if 


1 n 
(6.9) lim — >> exp (2ihf,;) = 0 
non 


j=l 


for every positive integer h. 

Proof. Let y be any real number satisfying 0 < y < 1, 
and let J denote the interval of points ¢ such that 0 S ¢t 
<-y. Recalling the notation (8) for the fractional part 
of 8, we let n(Z) denote the number of points among (8), 


(Be), °**, (Bn) that lie in J. Assuming (6.9), we prove 
that 

I 
(6.10) lim a =¥. 


noo nN 


This result, although special in that it relates only to the 
special intervals having zero as left end point, clearly im- 
plies the general result for any subinterval of the unit in- 
terval, inasmuch as any subinterval is expressible as the 
difference of two of these special intervals with left end 
point zero. The result (6.10) is obvious in case y = 1; 
so we may assume that y < I. 
Define the periodic function g(t) by 


gt) =1 for OSt<y; gt) =0 for ysSi<1; 
gt + k) = gf) 


for any integer kK. Then we have 


(6.11) n(I) = > g(8;). 


j=l 
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Now g(é) is not continuous. In order to get two continu- 
ous functions g;(é) and go(é) which approximate g(t) both 
below and above, we proceed as follows. Let e be any 
positive real number, sufficiently small so that 2e < y and 
2e<1—y. Define the periodic, continuous, broken-line 
functions g,(t) and ge(t) (see the accompanying figure) by 
the equations: 


0 


I i eee ee 
t axis 


0 Y yte lee 1 


gté+k) = g(t) and go(é+k) = go(t) for any inte- 
ger k; 


a@)=1 for eXStSy-—6 
g(t)=0 for yStS1; 

gi(t) lmear for OStSe and y—eSisy; 
g(t) =1 for OStS 
g(t) =0 for y+eS 
g(t) linear for yStSyt+e and l—eStS 1. 


se ist. 
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Note that for all values of ¢ 
(6.12) na) S9® S geld). 


By Lemma 6.5 both g;(é) and ge(t) have uniformly con- 
vergent Fourier series expansions; say 


gi(t) = a9 + >> (a, cos 2rht + by sin 2rht), 
h=1 
(6.18) 
go(t) = co + 2a (cy, cos 2rht + d;, sin 2rht). 
h=l 


We shall need the actual values of ag and co; thus 


(6.14) 
1 1 
ee a =f wda&=1+6 
0 0 


As for the other coefficients, it will suffice to have the 
rough inequalities 


(6.15) |an| $2, |b] $2, Jel $2, [dl S 2, 


for all integers h = 1; these can be proved very simply by 
direct estimates: for example 


1 
|a,| = lof gi (t) cos 2rht dt 
0 


< 2 max |9;(¢)|-|cos 2rht| < 2. 


Now the uniform convergence of the series in (6.13) im- 
plies the existence of an integer M such that for any value 
of ¢ 
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(6.16) > (a, cos 2xht + by sin Qrht) 
h=m-+1 


for any m 2 M. 
Turning to (6.9), we rewrite it in the form 
ee bis 
lim — >> (cos 2rhB; + isin 2rh8,) = 0. 
n>o 522} 


The limit of both the real and imaginary parts must be 
zero; so there exists an integer N so that 


(6.17) 


1 n 1 n 
— cos ahd < a and |— >> sin arhe,| < 7 


N j=l N j=l 


for all h = 1, 2, ---, M and for alln = N. Now let n be 
any integer satisfying both n = N andn 2 M. Replace 
t successively by 61, 62, ---, Bn in g(t) in (6.18), and add 
the resulting equations to get, by (6.14), 


Dd 91(8;) = nly — ©) 
j=l a 
+> DO (axcos 2rhp; + by sin 2rhB;) 


j=l h=M+41 


M 
2 ( (a, cos 2rhB; + b, sin 2rhB;). 


avy 


We reverse the order of summation in the finite sums in 
the last term. Then we apply the triangle inequality and 
(6.16) and (6.17). Since both g,(¢) and y — ¢ are non- 
negative, we get 
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nm 


Dd 91(8;) 
j=l 
= 3 g1(8;) | 
i=} n oo 
Pa ny =~ €) = > > (a, cos 2rhB; + ba, sin 2rhB;) | 
j=l |h=M41 
M n n 
->dYila cos 2rhB; + by >> sin ari 
h=1 j=l j=l 
n M n 
z my 9 — De ¥ | loa «| Hs cos Bhs, 
j=l h=l1 j=1 
+ {bal «| Do sin 2xhp; 
j=l 


IV 


M 
ny — €-) — ne —  [2-S 42% 
ny — 66). ary 


A similar analysis of g2(t) leads to the conclusion that 


DX 92(8;) S nly + 6) 


j=1 
for all sufficiently large n. Next we use (6.12) in conjunc- 
tion with these inequalities to obtain 


1 n 
y — 6 S— D) 9(6;) S 7 + 66. 
N jul 


But ¢ is arbitrarily small, and hence by (6.11) 


I 12 
lim ) = lim — )) 9(6,) = 7, 


nro nN nRW>ONjua}1 


which is (6.10). 
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Having completed the proof of Theorem 6.6, we now 
deduce Theorem 6.3 from it. We must establish that 
(6.9) holds for the values 8; = jt, where & is any irrational 
number. We note that 


n 


D. exp (2rihjé) 


j=1 
|exp (2riht(n + 1)) — exp (2xihé) | 
lexp (2rihE) — 1| 
<= lexp (2rihe(n + 1))| + lexp (2xihé) | 
2 exp (2miht) — 1 
2 
~ lexp (2riht) — 1| 
where c(h, &) is obviously independent of n; also note that 


[exp (27tht) — 1| ¥ Osince hf is irrational. Hence we ob- 
tain 


= c(h, &), 


1} 2 h 
i | Sew Gap (eo 


n>o MT | 51 n> 0 n 


= 0, 
which implies (6.9) with @; replaced by jé. 


Notes on Chapter 6 


Theorems 6.1 and 6.2 are due to A. Hurwitz, Math. Annalen, 39 
(1891), 279-284. These results have been extended to the complex 
case as follows by L. R. Ford, Trans. Amer. Math. Soc., 27 (1925), 
146-154. We say that a = a + bi is a complex irrational number 
if not both @ and 8 are rational, and that x = 21 + zgt is a complex 
integer, or Gaussian integer, in case both x and ze are rational inte- 
gers. If @ is any complex irrational number, then there exist infi- 
nitely many pairs x, y of complex integers such that 


«~! <a 
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This result is best possible in the sense that the constant +/3 cannot 
be replaced by any larger value; for example, a = (—1 + i+/3)/2 
is a critical number requiring the ~/3 in the above approximation, 
in the same way that (1/5 + 1)/2 served as a critical number in 
the proof of Theorem 6.2. Ford’s proofs are along geometric lines 
quite different from the Hurwitz method of continued fractions, 
which does not seem to generalize directly to the complex case. 

Theorem 6.3 was proved independently at about the same time 
by P. Bohl, W. Sierpinski, and H. Wey]; for details see J. F. Koksma, 
Diophantische Approzimationen, p. 92. Theorem 6.3 implies the 
following result of Kronecker: given any irrational 0, any real a, 
and any positive ¢, there exist infinitely many pairs of integers n, 
m such that 

|n8 —m—al<e 


A more general form of Kronecker’s theorem is given in Hardy and 
Wright, Chapter XXIII. Generalizations of both Kronecker’s theo- 
rem and Theorem 6.3 may be found in papers of H. Weyl, Math. 
Annalen, 77 (1916), 313-352, and J. G. van der Corput, Acta Math., 
56 (1931), 373-456. 

The work of Wey] just cited is the source of Theorem 6.6. The 
converse of this theorem also holds; that is, the condition (6.9) is 
necessary as well as sufficient for the uniform distribution of the 
6’s. This converse proposition, and the fairly easy proof thereof, 
were not included in the chapter because they are not relevant to 
our purposes. 


CHAPTER % 


ALGEBRAIC AND 
TRANSCENDENTAL NUMBERS 


I. Closure properties of algebraic numbers. Alge- 
braic and transcendental numbers were defined at the be- 
ginning of Chapter 3, and a few basic ideas were outlined. 
It is customary to separate the complex numbers into the 
types algebraic and transcendental, whereas it is the real 
numbers that are classified as rational and irrational. 
These usages are readily extended, of course. We would 
say that a + bz is a complex rational if both a and b are 
rational. Likewise a real algebraic number is simply one 
that is simultaneously real and algebraic. If a complex 
number is algebraic, must its real and complex parts be 
algebraic? The answer is yes, and to establish this we 
first prove the basic closure properties of algebraic num- 
bers. 


Lemma 7.1. Any r+ 1 linear forms in r indeterminates 
with rational coefficients are linearly dependent over the ra- 
tionals. 

Remark. ‘This is a well-known result, and so we give 
a reference but not a proof. The matrix of the linear 
forms cannot have rank greater than r, and this is less 
than the number of forms. For further details, see any 
book on the theory of equations, for example, J. M. 
Thomas, Theory of Equations, Theorem 27.7, p. 49, Mc- 
Graw-Hill. 

83 
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THEOREM 7.2. Algebraic numbers are closed under addi- 
tion and multiplication; in other words, the sum and the 
product of two algebraic numbers are themselves algebraic. 
More generally, algebraic numbers form a field. 

Proof. Let a and 8 be algebraic numbers of degrees 
m and n, respectively. Thus a satisfies an algebraic equa- 
tion of degree m, say 


(7.1) a” = an_:0"—! + an_oa’™? +-+-+ aja t+ ao, 


with rational coefficients a;. Thus a” is a linear combina- 
tion of 1, a, a’, ---, a”! with rational coefficients. The 
same is true for a” *!, as can be seen by multiplying equa- 
tion (7.1) by a and then using (7.1) to replace the term 
Qm—a”" by terms in a of lower degree. This process can 


be repeated, and so by mathematical induction we see 


that each of a”, at}, a™*?, -.- is expressible as a linear 
combination of 1, a, -+:, a”! with rational coefficients. 
Similarly, each of 8”, 6"*', "*?, --- is expressible as a 
linear combination of 1, 8, 8”, ---, 8") with rational co- 
efficients. 


Now consider the mn + 1 numbers 
(7.2) 1, Qa + B, (a + g)?, a) (a + py”. 


Expanding these and replacing the mth and higher powers 
of a by the lower powers, and likewise the nth and higher 
powers of 8, we see that these mn + 1 numbers can be 
written as linear combinations of the mn numbers 


ap, j3=0,1,---,m—1, k=0,1,-:-,n—1, 


with rational coefficients. These mn numbers can be sub- 
stituted for the r indeterminates of Lemma 7.1, and so we 
conclude that the numbers (7.2) are linearly dependent 
over the rationals. That is, a + 8 is an algebraic number. 

The proof for the product ef is analogous, the powers 
of a + @ in (7.2) being replaced by the powers of a. 
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It is clear that algebraic numbers are also closed under 
subtraction and division, except division by zero. For, if 
a satisfies f(x) = 0, a polynomial equation of degree m, 
then —a@ and a7! satisfy f(—z) = 0 and x” f(1/z) = 0, 
both polynomial] equations. 


Corotuary 7.3. For real values a and b, the number 
a + bi is algebraic if and only tf a and b are algebraic. 

Proof. The number 7 is algebraic since it is a root of 
x? +1=0. Hence, if a and b are algebraic, so is a + bi 
by Theorem 7.2. Conversely, if a + bi is algebraic, say 
f(a + bt) = 0 where f(x) is a polynomial with rational co- 
efficients, then a — 07 is also a root of f(x) = 0. Hence 
the sum 2a and the difference 207 of a + b7 and a — bi 
are also algebraic numbers, and so, multiplying by the al- 
gebraic numbers 1/2 and —7/2, we see that a and 6 are 
algebraic. 


2. A property of algebraic integers. It was estab- 
lished in § 3 of Chapter 1 that the rational numbers are 
everywhere dense. Since the rational numbers form a 
subset of the real algebraic numbers, it follows that these 
also are everywhere dense. Indeed we can say much more. 


TueoremM 7.4. The real algebraic integers of any speci- 
fied degree n = 2 are everywhere dense on the real line. 

Proof. Let a and @ be any two distinct real numbers: 
say with a < 8. We must prove that there is a real alge- 
braic integer y of degree n such that a<y<. First 
note that 


(x + 8)” — (@ + a)” = {(@ + a) + (B — a)}” 
— (t+ a)" > n(x + a)"“"(B — a) 


for all z such that x + a > 0. The above inequality was 
obtained by discarding terms past the second in the bi- 
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nomial expansion. Now the value of n(z + a)""1(8 — a) 
can be made arbitrarily large by choosing x large. Hence 
there is a positive integer 7 such that 


G+ 6)” — G+ a)" > 5, jta>0, j+8>0. 


Thus the open interval from (j + a)” to (7 + 8)” con- 
tains at least four consecutive positive integers, and so in 
particular a positive integer of the form 2+ 4k. By the 
continuity principle there is a real number y such that 
(G+ 7)” =2+ 4k, witha <y< 8. It follows that the 
real number y is an algebraic integer, since it is a root of 
the monic polynomial equation 


Se) = @ + 9)” — 2(1 + 2k) = 0, 


whose coefficients are rational integers. 

To complete the proof we must establish that the degree 
of the algebraic integer y is n: i.e., that the polynomial 
f(x) is irreducible over the rationals. This is equivalent 
to proving the irreducibility of f(e — 7) = x” — 2(1 + 2k). 
The simplest way of handling this would be to cite the 
Eisenstein irreducibility criterion (cf. glossary), but the 
following direct approach is easy enough. The zeros of 
the polynomial f(z — 7) are the complex numbers 


W/2(1 + 2k) £4, s= 1,2, +++, 2, 


where ¢ is a primitive nth root of unity. Assuming that 
f(x — j) were reducible, it would have a polynomial fac- 
tor g(x) with rational coefficients, say of positive degree 
w <n. Then g(x) would be the product of w linear fac- 


tors of the form 
xz — W221 + 2k) +t. 


Consider the absolute value of the constant term of such 
@ polynomial. It would be {2(1 + 2k)}“/", since the ab- 
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solute value of ¢* is 1 for any value of s. But under our 
assumption this would be rational, say a/b, 


{2(1 + 2k)}¥/" = a/b or 2”(1 + 2k)”b" = a”. 


The latter equation is impossible, because the power of 2 
dividing the integer on the right cannot equal the power 
of 2 dividing the left side. 


3. Transcendental numbers. In § 2 of Chapter 1 
it was proved that the rational numbers are countable. 
This is now extended to the algebraic numbers. 


THEOREM 7.5. The set of all algebraic numbers is count- 
able; likewise the set of all real algebraic numbers. 

Proof. For our purposes here we think of any algebraic 
number as satisfying some polynomial equation 


f(a) = agz” + ayz™— +--+ + an = 0 


with rational integral coefficients, a9 = 0. There is no 
loss of generality in taking a9 2 1. Define the index of 
the polynomial f(x) to be the positive integer 


n+ do + [ai| + Jae] +-+-+ la,|. 


Since n 2 1 and ap 2 1, the index is at least 2 for any 
polynomial. In fact there is only one polynomial of in- 
dex 2: namely x. This gives the algebraic number 0 as a 
root of the equation f(z) = x = 0. The polynomials of 
index 3 are x7, x + 1,xz— 1, 2x. These give the new al- 
gebraic numbers +1. Similarly the polynomials of index 
4 give new algebraic numbers +2, +4, +7. Clearly there 
is only a finite number of polynomials of any given index, 
and so only a finite number of algebraic numbers given 
thereby. By allowing the index to run through the natu- 
ral numbers, and listing the new algebraic numbers ob- 
tained at each stage, we would have enumerated a se- 
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quence of distinct algebraic numbers. Furthermore, all 
algebraic numbers would be in the sequence, because every 
polynomial has an index. Thus the algebraic numbers 
are countable. If at each stage we had listed only the 
real algebraic numbers, we would obtain the second con- 
clusion stated in the theorem. 


Corotuary 7.6. Almost all real numbers are transcen- 
dental. 

Proof. By Theorem 7.5 the real algebraic numbers are 
countable, and so have measure zero by Theorem 1.3. 
The corollary follows at once, by the definition of ‘almost 
all.” 

This corollary can be readily generalized to the com- 
plex case: almost all complex numbers are transcendental. 
To give meaning to this, we must extend the definitions 
of ‘‘measure zero” and “almost all’ to the complex case. 
A set in the complex plane is said to have measure zero if 
it can be covered with neighborhoods (circles, for exam- 
ple) of arbitrarily small total area. Thus any countable 
set has measure zero, by an argument analogous to that 
in the proof of Theorem 1.3. Almost all complex numbers 
are said to have property P in case the set of complex 
numbers (or points in the complex plane) which lack the 
property P has measure zero. 


4. The order of approximation. A real number £ is 
said to be approzimable by rationals to order n if there ex- 
ists a positive constant c depending only on & such that 
the inequality 


(7.8) os 


has infinitely many rational solutions h/k with k > 0 and 
(A, k) = 1. 
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If a number is approximable say to order 5, then it is 
a fortiori approximable to any smaller order, 4, 3, 2, or 1. 
Theorem 4.1 implies that any irrational is approximable 
to order 2; in this case the constant c of (7.3) would be 1. 
The same conclusion can be drawn from the stronger 
Theorem 6.1; this theorem would enable us to improve 
the constant ¢ to 1/+/5, but from our present viewpoint 
this improvement is immaterial. This result for irra- 
tional numbers should be contrasted with the following 
proposition on rationals. 

TxHreoreM 7.7. Any rational number ts approximable to 
order 1, but not to any higher order. 

Proof. Consider any rational number a/b with (a, b) 
= 1,621. It is well known from elementary number 
theory that there are infinitely many integral solutions of 
the equation az — by = 1. In fact, if x = x, y = Yo is 
one such solution, then the general solution is z = zo + Dt, 
y = Yo + at, where ¢ ranges over all integers. Whatever 
the values of xp and 8, it is clear that xo + bt is positive 
for infinitely many values of the integer t. Thus there are 
infinitely many integral solutions z, y with z > 0 to the 


formulas 
2 


x 


1 a sy 

b xl ba b 6 

This inequality can be interpreted as saying that the ra- 

tional a/b is approximable to order 1, for it is inequality 
(7.3) with n = 1, § = a/b, h/k = y/z, and ¢ = 2. 

On the other hand, for any fraction y/z * a/b we see that 


a y|_ [roll 
~ br 


b bz 
There is no constant c such that 1/bz < c/z? for infinitely 
many integers z, and so a/b is not approximable to order 
higher than 1. 


az — by = 1, 
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Theorem 7.7 can be used to prove the irrationality of 
such a number as 


£= 1071+ 10°? + 10-* + 107-8 +--+ 10-7" +.-., 


in the following way. Define the rational numbers 7, by 
means of 


tm = 1071 + 107? + 107-* + 1078 +---+ 1072”, 


so that 7m can be thought of as a fraction with denominator 
102". Then we can write 


gmt2 


+ 107 


gm ti 


l& — rm| = 107 fine pape 


= 2(10-2”)?, 


This can be interpreted as an inequality of the form (7.3), 
with h/k = rm, n = 2, k = 10", c = 2. Since there are 
infinitely many 7, as m runs through the natural numbers, 
& is approximable to order 2, and hence is irrational by 
Theorem 7.7. The argument just completed will be gen- 
eralized after Theorem 7.8, following the procedure of 
Liouville, to exhibit a class of transcendental numbers. 


THEOREM 7.8. <A real algebraic number of degree n is 
not approximable to order n +- 1 or higher. 

Proof. Theorem 7.7 covers the case n = 1, and so we 
take n = 2. Let the algebraic number é of degree n sat- 
isfy the equation, irreducible over the rationals, 


f(x) = agz™ + ayz"™ 1 +---+ a, = 0, 


with integral coefficients. For any x in the range & — 1 
<a <&-+1 we have |z| < || + 1, and hence the de- 
rivative f’(x) satisfies 
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(7.4) 
[f’(z)| = |magx"~! + (mn — 1)ayx"—? +---+ an_| 
S |nage* | + [(m — Vaye™?| + +--+ fans! 
< nlao| {]é]+1}" + (m — Ila] {]e]+1}"? 
feet |@n—1| = A. 


This equality defines the constant A. Now, if h/k is any 
rational approximation to £, we may presume that & — 1 
<h/k<€+1. Also f(h/k) #0 since otherwise f(x) 
would have the factor x — h/k, contrary to the irreduci- 
bility of f(z) over the rationals. It follows that 


ae Ss [aoh” + ayh™ "hk +--+ ank” | ~ 1 


By the law of the mean we have 


(7) =1(3) -s@ = G- E)r@) 


for some x between h/k and & Taking absolute values 
in this equation, we apply the last inequality and (7.4) 
to get 


k” can k”™ 


h | _ F/I 1 


ke f’@)| ~ Ak 


There is no constant c so that 1/(Ak") < c/k"* for infi- 
nitely many positive integers k, and consequently & is not 
approximable to order n + 1 or higher. 


Derinition. A real number ¢ is said to be a Liowville 
number if for every positive integer m there is a rational 
number hp/km with kp, > 1 such that 


(7.5) lé = In| Kern | < (km). 
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THEOREM 7.9. Any Liouville number is transcendental. 

Proof. Assume that a Liouville number é is algebraic 
of degree n. For all integers m 2 n+ 1 the inequality 
(7.5) would imply that 


|é =o Rn/km| < Gay = 


and thus £ would be approximable to order n+ 1. But 
this contradicts Theorem 7.8. 

Theorem 7.9 can be used to construct transcendental 
numbers. For example, consider 


& = 107! + 1077? +---+ 107-7! +--+ = 0.1100010---, 


and define hm/km as the sum of the first m terms of the 
infinite series, so that km = 10". Then we have 


lf _ hin/ Km | = 197+)! + 197 +2)! aes oa 
< 2-197@tD! < (10"')-™ 
= (km)~™. 


Thus the conditions (7.5) are satisfied, and & is transcen- 
dental. 

This single number £ can be readily extended into a 
whole class of transcendental numbers. For example, we 
could employ the series 


a; 107"! + a, 107?! + a3 1073! + --., 


where each a; is either 1 or 2. 


Notes on Chapter 7 


Theorem 7.2 implies that the real algebraic numbers form a field 
also, Algebraic integers are closed under addition and multiplica- 
tion, but this cannot be concluded from the proof of Theorem 7.2 
as given; see Pollard, pp. 58-60. 
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The material in § 3 is essentially the work of G. Cantor, that in 
§ 4 the work of J. Liouville. Liouville’s work was the earlier, in 
J. Math. Pures Appl. (1), 16 (1851), 183-142; this paper gives the 
first occurrence of transcendental numbers in mathematics. Lucid 
introductions to the Cantor theory and the theory of sets generally 
are given in E. Kamke, Theory of Sets, and R. L. Wilder, Introduc- 
tion to the Foundations of Mathematics. 

Much more is known about the approximation of algebraic num- 
bers than is indicated in Theorem 7.8. The following theorem of 
K. F. Roth, Mathematika, 2 (1955), part 1, no. 3, 1-20, is a refine- 
ment of earlier results by F. J. Dyson, C. L. Siegel, and A. Thue. 
If é is an algebraic number, not rational, and if there are infinitely 
many rational numbers h/k such that 


Jeter 
k , 


then » S$ 2. This inequality is best possible, in view of Theorem 4.1. 
There is an account of the Thue-Siegel result mentioned above 
in L, E. Dickson, Introduction to the Theory of Numbers, University 
of Chicago Press, 1929, Theorems 108 and 111. This account also 
gives (Theorem 107) the following result of Thue which is implied 
by his approximation theorem: for any integer c the Diophantine 
equation H(z, y) = c has at most a finite number of solutions in 
integers if H(z, y) is a homogeneous polynomial of degree n = 8, 


H(z, y) = aov™ + aye” ly + agr??y? +--+ + any”, 


with integral coefficients such that H(z, 1) is irreducible over the 
field of rational numbers. 


CHAPTER § 


NORMAL NUMBERS 


1. Definition of a normal number. To put the 
matter roughly at first, a normal number is one in whose 
decimal expansion all digits occur with equal frequency, 
and in fact all blocks of digits of the same length occur 
with equal frequency. For example, if the base is 10, then 
such a specified digit as 7 occurs with frequency 107, and 
any triplet of digits such as 357 occurs with frequency 
107%. As we shall see, almost all numbers have this prop- 
erty, which somewhat justifies the use of the adjective 
“normal.” 

We now formulate the definition exactly. Let z = 


.% %2X%3 ‘++ be an infinite decimal to base r, and let X, 
denote the block of digits zirz --- z,. Thus each digit 2; 
has one of the values 0, 1, ---, 7 — 1. For any particular 


value 6 among these r possibilities, let N(b, X,) denote 
the number of occurrences of b in the block X,. For ex- 
ample, if X, = 123113, then N(1, Xs) = 3, N(2, Xe) = 
1, N(8, Xe) = 2, N(O, Xe) = 0, etc. To illustrate fur- 
ther the meaning of this notation, we mention that 


r—l 


> Nb, X,) =n. 


b=0 


The number x is simply normal to base r if 
94 
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(8.1) lim ' vO, X,) = : 

n >a r 
for each of the r values of b. Thus z is simply normal to 
base r if each digit b occurs with frequency 1/r. 

The number z is normal to base r if each block By, of k 
digits occurs with frequency 1/r*. To spell this out in 
more detail, we let N(By, Xn) denote the number of oc- 
currences of the block B, = bibe --- b, in Xn. Then z is 
normal to base r if 


1 
(8.2) lim —N(B;,, X,) = r* 
nao? 
for all By, k = 1, 2,3, +--+. 


Finally, if y is any real number, y is said to be normal to 
base r if y — [y] is normal to base r. 

These definitions are in no way invalidated by the am- 
biguous decimal representation of certain rational num- 
bers. For example, consider the number y = 0.03000: -- 
= 0.02999--- to base 10. This particular number y is 
not simply normal to base 10, and so not normal to base 
10, regardless of which of its two decimal expansions is 
examined. Likewise, any number that has two decimal 
expansions to base r is not simply normal to that base. 

An example of a number that is simply normal but not 
normal to base 10 is given by 


0.01234567890 1234567890123456789 ---. 


In fact, since any rational number has a periodic decimal 
expansion whatever the base, it follows that normal num- 
bers are irrational. 

The above definition of a normal number is equivalent 
to the original definition of E. Borel, according to whom 
z is normal to base r if each of z, rz, rz, -++ is simply 


96 NORMAL NUMBERS Ch. 8 


norma! to all of the bases r, r?, 73, ---. To formulate this 


Borel definition as a set of equations, let us break N(B;, 
Xp) into k parts 


k 
(8.3) N(Be Xn) = DD Ns(Bu, Xn); 
&=l 


where N,(B,, X,) denotes the number of occurrences of By, 
in Xn, say by = 2;, be = 2j41, etc., which satisfy the con- 
dition j = s (mod k). For example, let Xg = 112312121, 
and we have N(12, X9) = 3, N,(12, X9) = 2, Ne(12, Xq) 
= 1, N(121, Xo) = 2, Ny(121, Xs) = 1, No(121, Xe) = 1, 
N3(121, Xs) = 0. 

Borel’s definition is equivalent to saying that x is nor- 
mal to base r if 


] 
(8.4) lim —N,(Bz, Xn) = k71r-* forall k, s, By. 
nron 


The proof of the equivalence of (8.2) and (8.4) will be 
given in §3. However, in view of (8.3) the implication 
is clear in one direction, and we now state this formally 
for reference. 


Lemna 8.1. If x satisfies (8.4), then x satisfies (8.2). 

8. S. Pillai established that the conditions (8.4) are re- 
dundant in part, and proved that x is normal to base r if 
z is simply normal to bases r, r?, r?, ---. That is, x is 
normal to base r if 


1 
(8.5) lim —N,(B,, X,) = kT 1r7* 
n >on 
for all By, k= 1,2,3,--:. 


That this is equivalent to our definition (8.2) will be es- 
tablished in § 3. 

It will be convenient to establish some simple lemmas 
for application later in the chapter. 
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Lemma 8.2. Let fi, fo, --+; fm be real functions of the 
real variable x satisfying the conditions 


lim {f1(z) + fo(x) +-+++ fm(z)} = 1, 
and 


1 
lim inff,(z) =— for 1=1,2,-++,m. 
27 0 m 


Then 


lim fi(z) = = for *=1,2,---,m. 
a m 

Proof. We prove the result for the case + = 1, and 
this involves no loss of generality because of the complete 
symmetry in the m functions. By standard results from 
function theory (for example cf. L. M. Graves, Theory of 
Functions of Real Variables, p. 62, Theorem 14), we see that 


1 = lim sup X fi(2) = lim sup f(z) + lim inf > fila) 


t=1 t=2 


> lim sup fi(z) + > lim inf f,(z) 
i=2 
m— 1 
2 lim sup f(z) + ——- 
m 


Thus we have lim sup f;(z) S$ 1/m S lim inf f,(2), and it 
follows that lim f;(z) = 1/m. 


Lemma 8.3. The union of a countable collection of sets 
of measure zero is a set of measure zero. 

Proof. Let the countable collection of sets of measure 
zero be denoted by Sj, Se, S3, ---. For any given posi- 
tive e, the points of S,; can be covered by a collection of 
intervals of length «/2, the points of S, by a collection of 
intervals of length «/4, and in general the points of S, by 
a collection of intervals of length e/2”. Thus the union 


98 NORMAL NUMBERS Ch. 8 


of the sets S, can be covered by intervals of total length 
not greater than 2/2” = e. 


2. The measure of the set of normal numbers. 
We now establish that almost all real numbers are normal 
to every base. 

In the previous section we used the notation X, to 
designate a fixed block of n digits. It is now convenient 
to regard X, as denoting any of the r” blocks of n digits 
to base r. For a fixed digit b define p(n, 7) as the number 
of blocks of digits X, = x12 --+ tm having exactly 7 oc- 
currences of the digit b, that is, having N(b, X,) =. 
Taking 0 Sj Sn, we see that 


n! 
— = nF 
(8.6) Pm d) = sD 
It is convenient also to define p(n, 7) = 0 in case 7 < 0 or 
j>n. 


Lemma 8.4. If 0 <a <1, then 1 — a < exp (—a). 
Proof. We use the series expansion of the exponential 


function to write 
ae 8 


exp (—a) = Lee >l-a. 
Lemma 8.5. Forj 2 2 we have 
p(nr, n+ Jj) <7" exp {—j?/4nr}. 
Proof. From (8.6) and Lemma 8.4 we see that 
p(nr, n + J) 
p(nr, n) 


_ (nr — n(nr — 2 — 1) ++: (ar ~n-j +1) 
(nF Dn $2) ++ Mt Ner — 1 
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HP OP OT 


on (r-1-2)... (r= 1-2) 
————— 
9 (r-1--).-(p-1-2) 
(r — 1)? 
=a ravi wont 
ron) 
< emp | = a Js eke | 


exp {—j(j — 1)/2n(r — 1)} < exp {—j°/4nr}. 


Also we have the rather obvious relation 
(8.7) p(nr,n) < Dy p(nr,j) = 1", 
j=0 
and the lemma follows from these inequalities. 
Lemma 8.6. For j = 2 we have 


p(nr,n — j) <1" exp {—j7?/4nr}. 
Proof. Lemma 8.4 and formula (8.6) imply that 
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p(nr,n — J) 
p(nr, n) 
aN) Gt DG = VF 
= (nr ~n+ 1)(nr —n +2) --- mr—-nt+y 


ae Cr Ce) 


(r— 1) 
(yr — 1+ 1/n)\(r — 14 2/n) +--+ 7-1 4+35/n) 


saga) a) 


n n 
= exp {—j(j — 1)/2n} < exp {—j?/4nr}. 
The lemma follows from this and (8.7). 


x 


Lemma 8.7. Let e > 0 be given. For all sufficiently large 
n the number of blocks X,, such that 


(8.8) |N(b, Xnr) — n| > ne 


is less than (nr)r" (1 + c:)~”, where c, is a positive constant 
depending on « but not on n. 

Proof. The number of blocks Xn, of nr digits satisfy- 
ing (8.8) is simply 

dD plnr,k)+ Do plnr,k) = D7 p(nr,n + J). 
k>n-+ne k<n—ne lj] >ne 

We estimate this sum as follows. Taking 7 sufficiently 
large so that ne > 1, we note that the sum has fewer than 
mr non-zero terms. Also by the two preceding lemmas we 
observe that each term of the sum satisfies the inequality 
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p(nr,n +7) <r" exp {—j?/4nr} < r™ exp {—(ne)?/4nr} 
= 1" exp {—ne?/4r}. 
Thus the number of blocks X,, satisfying (8.8) is at: most 
(nr)r"” exp {—ne?/4r} = (nr)r™"(1 + e,)~” 

where c, is defined by the relation 1 + c, = exp {e*/4r}. 
Lemma 8.8. For all sufficiently large m the number of 

blocks Xm satisfying 


> me 


(8.9) 


m 
N(b, Xm) — — 
- 


ts less than mr™(1 + c)~™, where c is a positive constant not 
dependent on m. 

Proof. If m is a multiple of r, this lemma is implied by 
Lemma 8.7. Thus our problem is to extend Lemma 8.7 
for say m = mr +d withO<d<pr. To any block Xp, 
there correspond r¢ blocks Xm where the first nr digits of 
Xm coincide with X,,, and the last d digits of X, are 
arbitrary. 

We now prove that, given any block of digits X,, which 
satisfies inequality (8.9), then the first nmr digits of this 
block, Xnr, satisfy (8.8). This is almost intuitively clear 
since the right side of (8.8), namely ne, is replaced in 
(8.9) by the much larger value me. To spell out the de- 
tails, we assume that a certain X,, satisfies (8.9). Then 
we use the triangle inequality to write 


me< 


Nob, Xm) , 4 “ = |N(, Xm) ~ N(b,Xnr) | + 
T 


|[N(0, Xnr) — n| + 


™m 
n--— 
Tr 


Sd+|N(b, Xar) — | +1, 
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whence we have, for sufficiently large n, 

|N(b, Xnr) — n| > me —d—12nre—d—1>ne, 
and this is inequality (8.8). 

Hence we can prove the present lemma by using the 


estimate from the previous lemma, multiplying by r? to 
account for the extra digits in Xm. This gives the bound 


rU(nr)r™™ (1 + ey)" S mr™{ (1 + ey)! —™ 
< mr™{(1 + ¢,)'27}— = mr™(1 + 0). 


Thus far the notation X,, has designated the block of 
digits 21:29 +++ 2m. It will be convenient also to regard 
Xm as the terminating decimal .1,z2 --- Zm or the corre- 
sponding point in the unit interval. 


Lemma 8.9. Let € and e, be arbitrary positive numbers. 
Cover each Xj = .%\2q +++ 2; for which 


NO, X,) - ‘| > je 
Tr 


with a closed interval (Xj, X; + 17%) for all j 2 m and all 
values of b. For all sufficiently large m the total lengths of 
these intervals is less than ey. 

Proof. The length of the interval (X;, X; + r~’) is ob- 
viously r~7. We use the bound given in Lemma 8.8 for 
the number of X; involved, and also we multiply by r to 
account for all values of b. Hence the total lengths of the 
intervals is less than 


i} 


22 0 +o). 


j=m 
This series converges for any value of m, and so we can 


choose m sufficiently large so that the tail of the series so 
obtained is arbitrarily small, in fact less than ¢. 
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TurorEeM 8.10. The set S of infinite decimals x = 
.@122%3 -++ which are not simply normal to base r has meas- 


ure zero. 
Proof. Let T(e) denote the set of those x such that 


1 1 
~N(b, Xj) —-|>.« 
J r 


for some b and for infinitely many 7. This is the inequality 
used in Lemma 8.9, and z is in the interval (X;, X; + r7”). 
Hence, by Lemma 8.9, the set 7'(e) has measure zero. By 
(8.1) we note that T(e) is a subset of S. In fact we have 
for any positive « 


S = T(e) U T(e/2) U T(e/4) U---. 


Each set in the union on the right has measure zero, and 
so S has measure zero by Lemma 8.3. 


THEOREM 8.11. Almost all real numbers are normal to 
every base. 

Proof. By Lemma 8.1 this can be established by prov- 
ing that almost all real numbers x have the property 
that, for every r > 1, each of x, rz, r*z, --- is simply nor- 
mal to all of the bases 7, r?, 73, ---. This is equivalent to 
proving that the following set has measure zero: the set 
of real numbers x such that 7”z is not simply normal to 
base r” for some positive integers r= 2, m20,n2 1. 
The proof is by a succession of applications of Lemma 8.3. 

Let S(r) denote the set of those real numbers that are 
not simply normal to base r. That portion of the set 
S(r) which lies in the unit interval was designated by S 
in Theorem 8.10. Now by that theorem S has measure 
zero, and so the set S(r) has measure zero. This applies 
to any base, so the set S(r”) has measure zero for any r 
and n. Let V(r) denote the union of the sets S(r”) for 
n = 1, 2,3, ---, and thus V(r) has measure zero. If each 
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element of V(r) is multiplied by r~”, there results a set 
which we denote by r-” V(r). This set has measure zero 
because any collection of intervals covering V(r) can be 
transformed by similar use of the multiplicative factor 
r—™ to give a covering of r~” V(r). Now r~” V(r) con- 
sists of those real numbers z such that 7” x is not simply 
normal to base r” for at least one value of n. If W(r) de- 
notes the union of r-” V(r) for m = 0, 1, 2, -+-, and W 
the union of W(r) for r = 2, 3, 4, ---, then W also has 
measure zero, and the proof is complete. 


3. Equivalent definitions. We return to the various 
definitions of normal number given in §1, and prove 
their equivalence. 

Lemma 8.12. Let r, k, and s be given positive integers 
wihr2=2,1sssk. Let B, be a fixed block of k digits 
to base r and « any positive real number. Then for all t 
sufficiently large the inequality 

t 2et 
(8.10) N.(Bx, At) > oe 
is satisfied by all blocks A; of t digits apart from at most 
er’ exceptional blocks. 

Proof. First we reformulate Lemma 8.8 with the base 
r replaced by r*. Thus the number of blocks Xm of m 
digits to base r* satisfying 


> me 


m 
Nb, Xm) — = 
r 


is less than mr*™(1 + c)—”, where the positive constant c 
is independent of m. Here 6b is a single digit to base r*. 


To say this another way, the inequality 


me = Nb, Xm) — —; 2 —me 
Tr 
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is satisfied by all blocks X,, apart from at most 

mrk™(1 + ¢)7™ 
special blocks. A fortiori, we can say that the inequality 


(8.11) N(b, Xm) 2“ — me 
r 


holds for all X,, apart from at most mr*™(1 + c)~” blocks. 
We now interpret this in base 7, so that b becomes a block 
B, of k digits and X,, becomes say Xm, with mk digits. 
Also N(b, Xm) becomes N;(Bx, Xmx), the subscript 1 oc- 
curring now because we do not count the total number of 
occurrences of B;, in Xmx, but only those appropriately 
located. We can extend this to get 


(8.12) Ni(By, Xmk) = N.(Bi, 7.0)) 


by regarding A; as the block X,,, with s — 1 digits at- 
tached at the left end, and t ~ mk — (s — 1) digits at- 
tached at the right end. We can be certain that (8.12) 
holds if the number of digits attached at the right end is 
fewer than k. This we do by regarding the integers {, s, 
and k as given, and from them we determine m as the 
quotient when we apply the division algorithm to t — 
s+ 1 and k, 


t-~s+l=mk+u, Osu<k. 
This implies that 
t t 
(8.13) mk St < mk + 2k, poets 


Thus we have obtained (8.12) by identifying most of 
the digits of A, with the block Xinz, 


Ay = @Qq +++ Gy 


= 102 °° * Ag_1X1%2 + °° LmeA1—u419t-u42 °°" Ay 
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with fewer than k digits both prior to and following Xz 
because s — 1 <k and u <k. Corresponding to a fixed 
set of values for X,, there are r’—”” blocks A; because of 
the t — mk digits available at the two ends. 

Thus we can rewrite (8.11) in view of (8.12) and the 
other discussion, and conclude that for all sufficiently 
large ¢ the inequality 


m 
(8.14) N,(Bi, A.) =| — me 
: 


holds for all blocks A, apart from at most 
mr*™(1 +c) rt *™ = mr'(1 + ¢)~™ 


exceptional blocks. As ¢ increases indefinitely, so does m, 
and so m(1 + c)~™” tends to zero. Thus (8.14) holds for 
all A, apart from at most er’ exceptional blocks, provided 
tis sufficiently large. 

Also we note that et/k > 2/r* for ¢ sufficiently large, 
and so by (8.13) 


In view of this, we see that the inequality (8.10) is satis- 
fied by any block A; which satisfies (8.14), and so the 
lemma is proved. 


TuHeoreM 8.13. If the infinite decimal x = .4\xq--- 
satisfies (8.2), then x satisfies (8.4). 

Proof. To prove this converse of Lemma 8.1, we be- 
gin by observing that (8.2) implies that for any block of 
t digits A; and any positive e 
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(8.15) Nigh 


rt rt rt 


for all sufficiently large n. 

Recall that N,(B,, Xn) counts the number of occurrences 
of B, in X,, of a certain type, namely b) = xm, be = Im41, 
+++, bg = Im4e—1 With m =s (mod k). We establish 
that, for t > k, 


(8.16) n—t+1 
((-—k+ 1)N.(Bi, Xn) = DD Nel Bu, cptj41-++ ty4e-1), 
j=l 


where the notation N, in the sum on the right is to be in- 
terpreted as counting occurrences of B, subject to the 
same condition on the subscripts: namely, occurrences of 
the form b; = 2n, +++, bg = Im4e—1 With m = 8 (mod k), 
Any such occurrence of B, in X, is counted exactly t — 
k + 1 times by the expression on the left side of (8.16), 
and at most t ~ k+ 1 times by the expression on the 
right as 7 ranges over its assigned values. 

Having established (8.16), we now approximate the sum 
on the right side of this inequality. As j ranges over the 
values 1, 2, --+, m—¢-+ 1, the block wjrj4, +++ 242-4 
is identical with any specified block A, of t digits at least 
nr~'(1 ~ e) times, by (8.15). Hence we can change the 
sum in (8.16) to a sum over all the r* possible values A,, 
thus 


(¢-k+ IN. (Bi, Xn) > nr“(1 — @) 2, No(Bi A). 


This sum ranges over all possible blocks A;, and the inde- 
terminate subscript o can remain entirely unspecified be- 
cause we now apply Lemma 8.12. Thus we apply the 
inequality (8.10) to N,(Bx, As) in all cases of nonexcep- 
tional blocks A,, in number at least 7’ — er‘. Disregard- 
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ing the exceptional blocks, we obtain 
(é —k + 1)N,(B;, Xn) 


t 2et 
> nr" — &)(r' — er’) ( — =) 


for n and ¢ sufficiently large. (The steps in this chain of 
inequalities are incorrect in case e is large, for example 
e = 1, but the final result is obviously correct for such 
values of e.) Hence we have, for any positive e, 


N,(By, Xn t 1 1 
“Oe 5 —_ (= - ) > apne 
n t~k+1 \kr* kr 

NBs, Xn 1 
lim ing VBe Xe) 2 
nwo nT ker* 


This result holds for the r* possible blocks B, and the k 
possible values of s. Hence we can apply Lemma 8.2 
to obtain (8.4), and the proof of the theorem is com- 
plete. 


TsroreM 8.14. The definitions (8.4) and (8.5) are 
equivalent to (8.2). 

Proof. Theorem 8.13 established that (8.2) implies 
(8.4). It is obvious that (8.4) implies (8.5). The logic 
will be complete if we prove that (8.5) implies (8.2), which 
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we now do. By (8.5) we see that, for any block of digits 
A, and any positive e, 
noon 
(8.17) Mi(Ap, Xx) > 
tr tr 
for all n sufficiently large. 
Next we establish that, with k << t <n, 


(8.18) N(Br, Xn) 2 Do N(Bi, ADNi (As Xn) 


nen 
> (3 = “) > N(Biy Ar), 
tr tr'] A; 
the second inequality arising from (8.17). This is seen 
by partitioning X,, into consecutive blocks of ¢ digits and 
then counting N,(A;, X,). Then we count N(B,, A), 
the number of occurrences of B, in A;, perform the ob- 
vious multiplication, and sum over all possible A;. This 
sum may not yield exactly N(Bz, Xn), because we lose 
any B, that straddles blocks of length ¢ in the partition- 
ing of Xy. 
The expression N(B;, A;) on the right side of (8.18) can 
be written : 
N(Bi, A;) = pe N.(Bi, Ai), 
s=1 
and Lemma 8.12 is applied to these terms. Thus (8.18) 
implies 


N(Br, Xn) > ¢ =") (rt rk ( <) 
? n. aa aaa — € ae ee 
: inf ire krt Ok 


-1(-a)a-9 


> n(r—* — 2e) (1 — 2e) 
> n(r—* — 46), 
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for all n sufficiently large. (As in the proof of the previous 
theorem, the steps in this chain of inequalities are incor- 
rect if ¢ is large, but the final result holds nevertheless.) 
Thus we have 
.— . -N(Bi, Xn) 
lim inf —————— 


noo n 


IV 


1 
rh 
for any block B, of k digits. Application of Lemma 8.2 
leads to the conclusion (8.2), which proves the theorem. 

There is a connection between normal numbers and the 


concept of uniform distribution modulo 1, the definition 
of which was given in § 3 of Chapter 6. 


TuHEoreM 8.15. The number x is normal to base r if 


and only if the numbers x, rz, r?z, +--+ are uniformly dis- 
tributed modulo 1. 
Proof. Suppose first that z, rz, r?x, --- are uniformly 


distributed modulo 1. Let the decimal expansion of the 
fractional part of x be .2,%_%3 --- to base 7, and let By = 
bybe --+ b; be any block of k digits. We must prove that 
By, occurs in .242%9t3 +++ with frequency r~*. Let J de- 
note the open interval of points y satisfying 


by bg see be Sy <r * + bybe «++ dy, 


so that I has length r~*. The decimal expansion to base 
r of every such number y begins with the digits bib, --- 
b,: that is, By. 

Next, if the fractional part (r"x) of any number r™z 
has decimal expansion beginning with 6,b2 --+ by, then 
(r™x) belongs to the interval J. For the only other possi- 
bilities are that (rx) is one of the end points of the closure 
of the interval, such as (r“r) = .bibo --- by. But these 
two possibilities are clearly ruled out by the hypothesis. 
It follows from the definition of uniform distribution that 


1 n 
lim — N(By, Xn) = lim —=r*, 
n 


nove 
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where n(Z) denotes the number of those points (2), (rz), 
+++, (x"—1!z) which lie in the interval J. 

Conversely, suppose that z is normal to base r. For 
any positive integer m, divide the unit interval into 7” 
closed subintervals, 


(8.19) 
(0, ro); Caase ar), 2r™, ods wets (1 a in, 1). 


Note that (rx), being irrational for every integer j, is not 
the end point of any of these intervals. The normality 
of the number z implies that the points (x), (rz), (r?z), 
+++ are distributed with equal frequency in these inter- 
vals. That is to say, if R denotes any one of the subinter- 
vals (8.19), and if n(R) denotes the number of those 
points (x), (rz), --+, (r”~*z) which lie in R, then 
n(R) 

(8.20) lm —— =r 


nro nN 


—m 


Next, let J be any subinterval of the unit interval, say 
I = (a, 8) withO<Sa<851. We are not assuming 
anything about the closure properties of J, so that a and 
8 may or may not belong to J. Let R, be the collection 
of those intervals (8.19) that lie entirely inside J. Thus 
the length of R, is at least 6 — a — 2r-”. Hence, given 
any « > 0, we see by (8.20) that, for all n sufficiently 


large, 


If we choose m large enough so that r~” < «/4, then we 
can conclude that 
nl 
MD) ae 
n 


for all n sufficiently large. 
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Similarly, if we write R2 for the collection of those inter- 
vals (8.19) that have one or more points in common with 
I, we see that the length of Rz is less than 8 — a + 2r7—”. 
Also we conclude that 

nT n{R € 
mM) ge) cp tort isp—ate 
n n 2 
for all n sufficiently large. Since « can be made arbitrarily 
small, we conclude that 
n(I 
lim me) =B- a, 
noo 71 
and the theorem is proved. 

Theorem 8.15 can be used to prove such a proposition 
as: if x is normal to base 7, so is jz for any non-zero integer 
j. This result holds also for any rational 7 ¥ 0, but the 
proof of this stronger statement is more difficult. 


4. A normal number exhibited. It is not known 
whether such numbers as 1/2, e, and m are normal to any 
base. It turns out that the decimal expansion obtained 
by writing the natural numbers in order gives a normal 
number. For convenience we shall prove this for the 
special base 10, although the proof is readily extended to 
any base. 


THEOREM 8.16. The number 
(8.21) 2+ =0.1234567891011121314--- 


formed by writing the natural numbers in succession is nor- 
mal to base 10. 

Proof. By Xz, we shall mean, as usual, the block con- 
sisting of the first n digits of z in (8.21). We think of X, 
as partitioned into blocks corresponding to the natural 
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numbers; thus 
(8.22) X, = 1, 2,3, 4, 5, 6, 7, 8, 9, 10, 

11, 12, 13, 14, -+-, ayaa +++ Gm, °° 


The last complete natural number in this partitioning of 


Xn is assumed to have digits ajaga3 --+ @m, and, if we de- 
note this number by u, we have 
wu = a,10"~) + agl0"~? +---+ dn, a, ~ 0. 


In (8.22) there are at most m digits after the last comma, 
and, since there are at most wu + 1 partitions, we see that 


(8.23) ns m(u+ 1). 

It is convenient to define the numbers 

uj = [(ul0~] = a,10"—1~ + agl0™ 279 +--+ On_ys, 

j=1,2,---,;m—1. 

We are concerned with estimating the number of occur- 
rences of B, in Xn, where By = bbe +--+ by. In making 
this estimate we count only the occurrences of B;, inside 
the partitions which make up (8.22), ignoring occurrences 
that straddle the commas therein. We look for blocks of 
digits in (8.22) having the form 
(8.24) yrye +++ Yedibde «++ deey2q +++ 2p = VeBuZy. 


There being at most m digits in any partition in (8.22), 
we require that s-+k+t< m. Moreover the natural 
number corresponding to the block of digits (8.24) must 
not exceed u, and this will be guaranteed if we require that 


y110°+ + y210°—? fet y, < a,10"—#—-#-1 
+ a210"—*-!-? + oie “+ Om kt = Uk4e- 


In view of this, we have uz4,; — 1 possible values for the 
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block Y, in (8.24), and 10° possible values for the ¢ digits 
Z, Thus the number of blocks of digits of the form 
(8.24) to be found in (8.22) is at least 


m—k—1 


> 10°(un4e = 1), 
=0 


where we get this maximum value m — k — 1 for ¢ by 
setting s=1 in the inequality s+k+tXm. Now 
N(By, Xn) is the total number of occurrences of By, in 
(8.22), and so we have 


m—k—1 


N(Bz, Xn) = DS 10g — 21). 


t=0 
By the definition of w; we have u; > ul0~? — 1, and so 


m—k—1 


N(Bi, Xn) > DS 10'(u-107*-* — 2) 
t=0 
m—k—1 m—k—1 
= > wl0*—- DD 2-10' 
t=0 =0 


> (m — k)u-10~* — 10", 


Dividing the left side by n and the right side by m(u + 1), 
we see by (8.23) that 


1 
— N(Bx, Xn) 
n 
m-k u 1o™-* 4 
—_—— 107* — — 
m u+l utlm 
1 k 107-* 1 
= 10 — 10-* | ice ea oe ae 
“eti uti m util m 
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Note that 


<i. 


Also we see that, as n tends to infinity, 


u—> , m— ©, —— 0. 
m 


Hence, for any given e > 0, we can choose n sufficiently 
large so that 


1 
~ N(By, Xn) > 107 ~ ¢, 
n 
1 
lim inf — N(B;, Xn) = 10~*. 


nao 2 


This holds for all blocks By, of k digits, and so we apply 
Lemma 8.2 to get 


: N(Bi, Xn) 
lim ———— 


n> © n 


= 107-4, 


Thus z is normal to base 10. 

This proof could be shortened slightly by the use of 
Lemma 8.12, but we have preferred to give a direct argu- 
ment independent of that lemma. 


Notes on Chapter 8 


Normal numbers were introduced, with definition as in equations 
(8.4), by E. Borel, who proved the central Theorem 8.11 that al- 
most all real numbers are normal; cf. Rend. Circ. Mat. Palermo, 27 
(1909), 247-271. The special case of Theorem 8.11 that almost all 
real numbers are simply normal can be proved as an application of 
the strong law of large numbers in the theory of probability; cf. 
Paul R. Halmos, Measure Theory, Van Nostrand, 1950, p. 206. 
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The equivalence of definitions (8.4) and (8.5), proved in Theorem 
8.14, is due to S. 8. Pillai, Proc. Indian Acad. Sct., A, 12 (1940), 
179-184. Theorem 8.13 is due to Niven and Zuckerman, Pacific 
J. Math., 1 (1951), 103-109; the proof given here is due to J. W. S. 
Cassels, Pacific J. Math., 2 (1952), 555-557. The proof of Theorem 
8.14 was suggested by this paper of Cassels and work of J. E. Max- 
field, Pacific J. Math., 2 (1952), 23-24. 

J. E. Maxfield has treated normal numbers in the n-dimensional 
case; cf. Pacific J. Math., 3 (1953), 189-196. Maxfield also points 
out (p. 195) that the set of numbers simply normal to no base what- 
ever is uncountable. It is easy to establish the somewhat weaker 
proposition that the set of numbers not simply normal to a specific 
base r is uncountable: consider the uncountable subset thereof of 
numbers of the form .0z20z,0z5 --+, where each 22, is either 0 or 1. 

Theorem 8.15 is due to D. D. Wall, Normal Numbers, Ph.D. 
thesis (1949), University of California, Berkeley, California. 

Theorem 8.16 is due to D. G. Champernowne, J. London Math. 
Soc., 8 (1933), 254-260. This result has been generalized by A. H. 
Copeland and P. Erdés, Bull. Amer. Math. Soc., 52 (1946), 857- 
860. 

One topic that we have not touched upon is the « normality of 
integers of A. 8. Besicovitch, Math. Zett., 39 (1934), 146-156. See 
also H. A. Hanson, Can. J. Math., 6 (1954), 477-485. 

E. Borel defined an absolutely normal number as one that is normal 
to every base. However, might it be that if a number is normal to 
one base, then it is absolutely normal? The answer is no, as demon- 
strated independently by Cassels and Schmidt. For this and even 
stronger results, see J. W. S. Cassels, Collog. Math. 7 (1959), 95-101, 
and W. Schmidt, Pacific J. Math. 10 (1960), 661-672. 

The definition (8.2) of a normal number contains superfluous 
conditions: it is enough to say that the requirements are satisfied 
for infinitely many integers &. In fact it is not difficult to prove 
that, if the conditions are satisfied for some fixed k, they must, also 
hold for k — 1. However, it is not enough to require that the equa- 
tions hold for some finite set of values of k: cf. I. J. Good, J. Lon- 
don Math. Soc., 21 (1946), 167-169; also D. Rees in the same jour- 
nal, 21 (1946), 169-172. 


CHAPTER Q 


THE GENERALIZED LINDEMANN 
THEOREM 


1. Statement of the theorem. Transcendental 
numbers were first exhibited by Liouville, using a tech- 
nique which we set forth in Theorem 7.9. Later, Her- 
mite in 1873 proved that e is transcendental (Theorem 
2.12), and Lindemann extended the method to z in 1882. 
The transcendence of e and 7 are special cases of a more 
general theorem of Lindemann which is the subject of 
this chapter. We state two equivalent forms. 


THEOREM 9.1. Given any distinct algebraic numbers ay, 
Q2, ***, Am, the values e™', e%, ---, e™ are linearly inde- 
pendent over the field of algebraic numbers. 

Alternative statement. Given any distinct algebraic num- 


bers a, a2, ***, Om, the equation 
™m 
(9.1) > ayes = 0 
j=l 
ts empossible in algebraic numbers a1, Az, ++, Am notall zero. 


The proof of this result is given in the next two sections. 
It is clear that this theorem implies that e is transcenden- 
tal, indeed that e* is transcendental for any non-zero al- 
gebraic number a. This observation includes the trans- 
cendence of 7 as a special case. For, if x were algebraic, 
so would tr be algebraic by Theorem 7.2, and hence e’™ 
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would be transcendental, contrary to the well-known fact 
that e** = —1. Theorem 9.1 has other consequences, but 
we postpone further discussion of these until § 4 in order 
to get on with the proof of the theorem itself. 


2. Preliminaries. We begin by outlining some needed 
algebraic background material, in particular some deeper 
properties of algebraic numbers than we have discussed 
heretofore. No proofs will be given of Theorems 9.4, 9.5, 
or of the fundamental result on the algebraic dependence 
of any symmetric polynomial on the elementary sym- 
metric polynomials. These are standard results; so we 
merely cite references. 

The elementary symmetric functions ¢1, g2, °++, on of 
21, Z2, ***, Xm can be defined by the identity 


(y — 21)(y — %2) +++ (y — tn) 
= y” oe oy”? + voy”? ee +f (—1)"on. 


In general a symmetric function of x1, Z2, +--+, Zn is one 
that is invariant under any permutation of 21, -+-, Zn. 
Any symmetric polynomial in 2), :--, 2, with coefficients 
in a field F is expressible as a polynomial in the elementary 
symmetric functions o1, -:-, on With coefficients in F. 
Moreover, if the symmetric polynomial in 21, -+-, Z, has 
rational integers for coefficients, so does the polynomial in 
ot, ***, on. Proofs of these results can be found readily 
in books on the theory of equations; for example, J. V. 
Uspensky, Theory of Equations, McGraw-Hill, p. 63, p. 
264 (1948). These results imply the following proposition. 


TuHeoreM 9.2. Let Bi, B2, --+, Bn be the roots of a poly- 
nomial equation 


f(z) = bz” + cya"! + cg"? +++ -+ ce = 0, 


with rational integral coefficients. Let P(x, 22, ---+, Xn) be 
a symmetric polynomial in x1, +++, In with rational coeffi- 
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cients. Then P(B1, Bo, +++, Bn) is a rational number. 
Moreover, if the polynomial P has integral coefficients and 
is of degree t, then b'P(B,, Bo, ---, Bn) is an integer. 

Proof. The first part follows from the close relation 
between the elementary symmetric functions of 6, ---, 
6, and the coefficients of f(x): o, = —c,/b, etc. Now bf, 
bBo, -+-, Bn are the roots of 


b"—"f(a/b) = 2” + ez"! + begt™? +---+ b™ 1c, = 0, 


and so the elementary symmetric functions of b@;, be, 
+++, 68, are integers. If p(x, 22, ---, 2n) is a homogene- 
ous symmetric polynomial of degree r S ¢ with rational 
integral coefficients, then 


b’p(Bi, Ba, ee | Bn) = p(bBi, bBe, eae} bBn), 


and so b'p(1, Be, +, Bn) is an integer. By separating 
the polynomial P of the theorem into a sum of homogene- 
ous polynomials p, we readily extend this argument from 
p to P. 

We shall also make use of the following result on sym- 
metric functions. 


Lemma 9.3. Consider the q polynomials P,, Po, ---, Pg 
in Y1s Ya, + °°) Um) 
P5 = files)ys + falrs)y2 ++ +++ fn(xs)y¥m; 
j= 1, 2, “75d 


with coefficients f;(x;), where all the f;(x) are polynomials 
over any field F. The product of these polynomials, the 
terms in y being collected, has coefficients which are sym- 
metric polynomials in x1, 2, +++, Xq- 

Proof. Let us write the product 


(9.2) PyPa ++ Pa = Qo Yin Yin “++ Yig 
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The condition 7; S 12 S---S 7, is imposed on the sum to 
indicate that the terms have been collected. We must 
prove that each coefficient c = c(z,, ---+, Z,) is symmetric 
in the 2, ---, 2. Any permutation of 2, ---, Zq ap- 
plied to (9.2) leaves the left side invariant because it 
merely permutes the polynomials P;, Pe, ---, Py. Hence 
such a permutation leaves the right side invariant, and so 
must leave each coefficient c invariant. 

We shall assume that the reader is familiar with the 
concept of extension of an algebraic number field, as 
given, for example, in the Carus Monograph by Harry 
Pollard, The Theory of Algebraic Numbers, Chapter IV. 
In particular, we state without proof the following results, 
which are Theorems 4.6 and 4.7 of Pollard. 


TueoreM 9.4. If F is an algebraic number field, and if 
6 ts algebraic over F, then any element 8 of the extension field 
F(6) can be expressed as a unique polynomial in @ with co- 
efficients in F, 


B = a + a6 + a6? +--+ @n_167”, 


where n ts the degree of 8 over F. 

We shall need a generalization of this to a multiple ex- 
tension F'(6;, 62, ---, 9), that any element of this exten- 
sion is expressible as a polynomial in 6), 62, -++, 6; with 
coefficients in F. This generalization is immediately ob- 
tained from Theorem 9.4 by use of mathematical induc- 
tion on s. 


THEOREM 9.5. A multiple algebraic extension of an al- 
gebraic number field F is a simple algebraic extension. In 
other words, if the numbers ay, a2, +++, as are algebraic over F, 
then there is an algebraic number y such that the extension of 
F by y ts the same as the extension of F by a, a2, +++, a; thus 


F(y) = F(a, a2, «++, a). 
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We shall write R for the field of rational numbers. 
The algebraic field R(@) is said to be normal over R, pro- 
vided that any polynomial irreducible over R that has 
one root in 2(6) has all roots in R(6). 


THEOREM 9.6. Given any algebraic numbers a1, a2, -*:, 
ag, there exists an algebraic number 6 such that the extension 
field R(@) contains the field R(a,, ---, ae), and R(8) is 
normal. 

Proof. First we apply Theorem 9.5 to obtain y such 
that R(y) = R(ai, ---, as). Now let the minimal equa- 
tion of y be A(x) = 0 with coefficients in R, and let the 
roots of this be y = v1, Y2, ***, Ym. Again we apply 
Theorem 9.5 to find an algebraic number 6 such that 


R() = Rtn, Y2, °"") Ym) >) R(y) = R(x, G2, °°", as). 


To prove that R(6) is normal, let g(z) = 0 be an irreducible 
equation over & having a root pin R(6). Thus g(z) is the 
minimal polynomial of p. Now, by the generalization of 
Theorem 9.4, p is a polynomial in 71, v2, -*-, Ym With ra- 
tional coefficients, say p = f(v1, Y2, °**; Ym). We form 
the polynomial 


G(zx) = Il {x ern Yip td *, Vin) } 


of degree m! in z, where the product is taken over all per- 
mutations 71, 72, °+*, Im of 1, 2, ---, m. The coefficients 
of G(x) are symmetric polynomials in its roots: that is, in 
S(Yiss Yix ‘**; Yin)» Any permutation of yi, -+-, Ym 
merely permutes the f(7:,, Yi. ‘°°, Yi,) among them- 
selves. Hence the coefficients of G(x) are symmetric poly- 
nomials in ¥1, ye, -*°, Ym, and so are rational numbers 
by Lemma 9.2. The polynomials g(x) and G(x) have the 
root p in common, and hence the minimal polynomial 
g(x) is a factor of G(x). But all roots of G(z) = 0 are 
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elements of the field #(6), and so all roots of g(z) = 0 are 
in R(6), and the proof is complete. 

Let the degree of the normal field R(6) be n, so that 6 
satisfies a minimal equation with rational coefficients, say 


(9.8) f(z) = 2” + bx”! + bot”? + --- +d, = 0, 


irreducible over R. Any element of R(@) is expressible as 
a unique polynomial in @ of degree at most n — 1. Write 
6 = 9), gf) 9... @™ for the roots of (9.3). Since 
R(6) is normal over R, all these roots belong to R(@); so 
we can write these conjugates in the form 


(9.4) @9) ax h;(6), j= 1, 2,- “yn, 


where the h;(#) are polynomials in 6 with coefficients in 
R. The elementary symmetric functions of h,(6), he(8), 
+++, h,(0) are also polynomials in 6 with coefficients in R, 
and so these reduce to the rational numbers —)},, be, 
—b3, «++, (—1)"b, of (9.3) when the relation 


6” = —b,6"! — boo"? —---— db, 


is used to eliminate @” and higher powers. 

Now 6) satisfies the same relation, and therefore the 
elementary symmetric functions of h,(6%), he(0), ---, 
h,(@™) are also equal to —b,, be, —bg, «++, (—1)"bn. 
Hence we conclude that the numbers 


hy (6), ho (8), » ++, An(O) 


are the roots of (9.3), and so are the same as 0°, 
6@), ..., @™ in some order. Another way of stating this 
is that, if 6, g@, ---, 6 are regarded as polynomials 
in 6, and if in these polynomials @ is replaced by 0, 
there merely results a permutation of the conjugates 6”, 
-++», @™., There is nothing special about 6 in this 
analysis; so we can generalize to any 0, 


2 
66 , 
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Lemma 9.7. Let R(8) be a normal algebraic extension of 
degree n over R, the conjugates of 6 being 9 = 6°), @, .-., 
g™, These conjugates, regarded as polynomials in 6, are 
merely permuted by the substitution of @ for 6. More gen- 
erally, if F(x) is any rational polynomial, then the set 


(9.5) Fo), Fg); ae) Fe) 


is permuted by the substitution of any 0 for 8. 

Proof. The set (9.5) can be regarded as a set of poly- 
nomials in 6 by virtue of (9.4). Replacement of @ by 0 
permutes the conjugates 6", ---, 0 by the argument 
preceding the lemma, and so permutes the values (9.5) in 
the same way. 

Any element y of 2(@) is a polynomial in 6 with rational 
coefficients, say y = F(6), and the values (9.5) are called 
the conjugates of y over the field R(6). These conjugates 
are also written as y = y°?, y®, ---, y“. The elemen- 
tary symmetric polynomials of y, ---, y are sym- 
metric polynomials in 6“, ---, a, and so are rational 
numbers. This proves the following result. 


Lemma 9.8. Any element y of R(@) and its conjugates 
over R(6) satisfy a polynomial equation g(x) = 0 of degree 
n with integral coefficients. 

We conclude this section with the following rather spe- 
cial result. 


Lemma 9.9. Consider the functions 
m t 
f(a) = Do aye, g(x) = Do dja, 
j=l j=1 


with non-zero complex coefficients a; and b;, and exponents 
a; and 8; that are algebraic numbers. Assume that the a; 
are distinct, and likewise that the 8; are distinct. If the 
product f(x) g(x) 1s formed, and all terms with equal expo- 
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nents are combined, then there is at least one non-zero co- 
efficient in the result. 

Proof. Applying Theorem 9.6 to the numbers ay, ---, 
Om, Bi, °**, Bt we get a normal field R(6) containing all 
these numbers. As before, let n denote the degree of the 
field R(6). By Theorem 9.4 we can write each a; as a 
unique polynomial in @ with rational coefficients, 

n—-1l 
aj > Pr. i". 
i=0 
We order these a; as follows: we say that a; precedes a, 
in case the first non-zero term of 


T30 — Tko, 771 —~ Tk1, 72 — Tre, °*° 


is positive. In terms of this ordering principle, let us re- 
arrange notation so that a is first among the a;, and fp; 
first among the @;. Then it is clear that a; + @ is first 
among all sums a; + 6;. Thus, in the product f(x) g(x), 
the term a,b,x%'t*! has a unique exponent, and so cannot 
be combined with or canceled by any other terms. 


3. Proof of the theorem. In proving the generalized 
Lindemann Theorem 9.1, we find it convenient first to 
establish the following special case. 


THEOREM 9.10. Given any m distinct algebraic numbers 
Oy, Gg, °**, Om, the values e*', e%?, ---, e*" are linearly inde- 
pendent over the field of R of rational numbers. 

Proof. Let us assume, contrary to the theorem, that 
there is a relation 


(9.6) > a; exp (a@;) =0 


j=1 


with rational coefficients not all zero. Discarding terms 
with zero coefficients and rearranging notation, we may 
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presume that no coefficient is zero. Furthermore, by 
multiplying (9.6) by a suitable integer, we may presume 
that the coefficients are non-zero rational integers. We 
use Theorem 9.6 to get a normal field R(@) which contains 
a1, 2, °**, &m. Let n be the degree of R(@) over R, so 
that each a; is expressible as a unique polynomial in @ of 
degree n — 1 with rational coefficients, say 


n—l 
a; = Do ry6', j= Ty 25 3+ Am, 
i=0 


As in the previous section, let 6 = 6, 0, ---, 0™ be 
the conjugates of 6, and so the conjugates of the a; over 


R(@) are 


n—l 
af? =F rslo¥ 
i=0 
J 12, 23%, k=1,2,--+,n. 


Each 6) is, like 6 itself, an algebraic number of degree n; 
so these polynomial expressions are unique in 6“, Con- 
sequently the hypothesis that the a; are distinct implies 
that for fixed k the aj” are distinct. 
Next we form the product 
n m i 
(9.7) O= II Do aexp (a?) = 2 cj exp (8), 
k=l j=l j=0 
and this product vanishes because of (9.6) and the fact 
that a{” is merely alternative notation for a;. The right 
member of (9.7) is the result of multiplying out the prod- 
uct and collecting terms with identical exponents, if any. 
Thus the @; are distinct. Since the a; are rational inte- 
gers, so are the c;. Furthermore, since the a; are non- 
vanishing, by Lemma 9.9 extended to a k-fold product we 
can conclude that at least one coefficient c; does not van- 
ish, say co # 0. 
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For a fixed value of j, the n conjugates a; are permuted 
by substitution of 6 for @ according to Lemma 9.7. 
Also by the proof of that lemma the permutation is the 
same whatever the valué.of 7. Hence the replacement of 
@ by 6 merely permutes the factors in the product in 
(9.7), so that the total product is invariant. As to the 
right member of (9.7), the replacement of @ by 6 re- 


places each 8; by the conjugate 6!. Hence (9.7) implies 


(9.8) 0 = Di csexp (B)”) = 2 cj exp (8) =--- 
7=0 j=0 


= > ¢; exp (8”). 
j=0 


We noted earlier that the 6} are distinct. Hence the 
8% are distinct for fixed 7. 

We multiply the first sum in (9.8) by exp {—8§”}, the 
second sum by exp {—£§”}, ---, the last sum by exp 
{~g§”}. Define 


(9.9) 3° = 8) — Bo’, 


Since the g are distinct for fixed 7, no y{” equals zero. 
Then equations (9.8) can be written 


(9.10) 
0 = eo + dics exp (yj) = co + Li cj exp (y;”) = °°: 


j=l j=l 


= co + Do csexp (7). 
j==1 


Now by Lemma 9.8 the conjugates y)?, Wyre, yf” are 


roots of a polynomial with integral coefficients, say 
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2-4) 
1 


(9.11) gj(z) = be” +--+ = by 


n 
= 


j=1,2, +--+, 7. 


We may take the integers b; > 0. Also, since no y{” = 0, 
it follows that the constant terms in these polynomials, 
g;(0), are non-zero integers. 

This completes the algebraic portion of the proof of 
Theorem 9.10, and we turn now to its analytic aspect. 
For any polynomial f(z), define F(z) to be the sum of f(z) 
and its derivatives, 


Fa) =f@+/@+fP@t--, 
so that we have 
{ F(z) exp (—2z)}’ 


—S@) exp (—2), 


F(b) — FO) exp (6) 


b 
— exp wf f(z) exp (—z) dz. 


Substitute for b in this relation the values yf? as in (9.9), 
multiply each equation so obtained by c;, and sum over 


j=1,-+-:,randi =1,---,n to get 
rT n n Tr 
DX LX FO?) — FO) 2 Xe; exp {y}} 
j=l t=1 t=1 j=1 
r n x) 
7 4 
= = © Vevewp ty) [/ fe) exp (-a) ae 
j=l tel 0 


Applying (9.10) to the second term of this, we obtain 


(9.12) Die; p> F )| + neoF (0) 


j=1 
Dace) 


=- > Dd ej exp {7} |? f@) exp (—2) dz. 
0 


jel] isl 
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We now define the polynomial f(z) as 
Li Pp 
(9.13) f(z) = (bibe ++ b,)?7"2? (I 9;(2)¢ /(p — 1)}, 
j=l 


where p is a prime to be specified: We shall choose the 
prime p sufficiently large so that the left side of (9.12) is 
a non-zero integer whereas the right side is, in absolute 
value, arbitrarily small. Thus we will have the contra- 
diction which establishes the theorem. 

Because of the factor z?~! in f(z), it is clear that 


0 = f(0) = f’'(0) =f (0) =--- =f? (0), 
fP—Y (0) = (bibe --- by)?" TT {9,(0)}?. 
j=1 


We choose p > b; and p > g;(0) for j = 1, 2, ---, r, and 
hence p is not a divisor of the non-zero integer f‘?~! (0). 

On the other hand, for t = p we argue that f(0) is an 
integer divisible by p. To see this we think of f(z) as a 
sum of powers of z, so that the coefficient of each term in 
f(z) has ¢ consecutive integers as factors entering in 
from the differentiation process. But for ¢ 2 p the prod- 
uct of ¢ consecutive integers is divisible by p!, so that the 
(p — 1)! involved in f(z) is canceled. Moreover we can 
write 


(9.14) f(z) = plbybe «++ be)?" Gi(z), 


where G,(z) is a polynomial with integral coefficients of 
degree at most prn — 1. Hence f‘(0) is an integer di- 
visible by p. Using this and the result of the last para- 
graph, we conclude that F(0) is an integer prime to p. 
We require p to satisfy the inequalities p > n and p > ¢o, 
so that p is not a divisor of the integer ncgf(0) in (9.12). 

We now establish that the other term on the left side 
of (9.12), namely 
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x Cj [> Fos?)| 


t=1 


is an integer divisible by p. In fact we establish that the 
inner sum 


> F(yf?) = Ese”) + Lop) + IF (of) +: 


t=0l t=1 t=1 


is a multiple of p. Since f(z) has the factor {g;(z)}?, we 
see that (9.11) implies 


f(y?) = 0, f' (vf) a 0, a: fP-Y yf) = 0. 
As for the higher derivatives, we turn to (9.14), whence 


(9.15) 


n n 

DLO?) = VL Orbe +++ by)” Gely,”), bE. 
i=l i=l 
Now G,(z) is of degree at most prn — 1, so that in view of 
the factor b;?”" we can apply Theorem 9.2 to conclude 
that (9.15) is indeed an integer divisible by p. 

We have now established that the two expressions com- 
prising the left side of (9.12) are integers, the first a mul- 
tiple of p, the second prime to p. Hence the left side of 
(9.12) is a non-zero integer; so, if we take absolute values 
in this equation, we can write 


(9.16) 1s > > c; exp { mira f™ "1@) exp (—z) dz|- 


j=l i=1 


Define the following maxima for all 7 and 7: 


m, = max |cy| ; mz = max|exp (y;")|; 


m3 = max | {| ; 
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ms, = max|exp (—z)| on the straight line path from 
z=0 to z= 7"; 
r 
ms = max| J g;(z)| on the same path. 
j=1 


Note that m?—' is the maximum of |z?~| on the same 
path. Then the inequality (9.16) implies, with (9.13), 


1 < rnmymegm3m4(b1 be See b,)?""m3—m2/(p al 1)! 
= rnmymgm4(by"by” + ~~ b;"m3ms5)?/(p — 1)! 
Now r, n, 1, M2, M3, M4, Ms, by, be, «++, b, are independent 


of p, so that the last expression tends to zero as p tends 
to infinity. Thus we have a contradiction, and the proof 
of Theorem 9.10 is complete. 

It is now an easy matter to establish that the basic 
Theorem 9.1 is a consequence of Theorem 9.10. Suppose 
that there is some relation (9.1), where we may assume 
without loss of generality that none of the algebraic num- 
bers a; is zero. We use Theorem 9.6 to determine an al- 
gebraic field R(@), normal over R, which contains the al- 
gebraic numbers a;. The conjugates aS) belong to R(6), 
a field which we presume to have degree g over R. (Here- 
tofore we have used the symbol 7 to denote the degree of 
the field R(#@). Our switch to g is to emphasize that now 
R(6) is an extension field determined by the coefficients 
a;, whereas formerly it was determined by the exponents 
a;.) We form the product 


g 

IT {a1” exp (a1) + a2) exp (a2) +-+-+ an exp (am)} = 0. 
t=] be 

If we think of the coefficients a‘ as polynomials in 9, 


we see by Lemma 9.3 that this product has coefficients 
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which are symmetric in 9, 9, ---, 0, and so are 
rational by Theorem 9.2. By Lemma 9.9 the product 
does not vanish identically. Thus we have established 
that a relation (9.1) implies a similar relation with ra- 
tional coefficients, and this was proved impossible by 
Theorem 9.10. 


4. Applications of the theorem. In § 1 we pointed 
out that Theorem 9.1 implies the transcendence of e and 
aw. More generally we can state the following. 


THEOREM 9.11. The following numbers are transcen- 

dental: 

(a) e, 7; 

(b) e%, sin a, cos a, tan a, sinh a, cosh a, tanh a for any 
non-zero algebraic number a; 

(c) log a, arcsin a, and in general the inverse functions of 
those in the list (b), for any non-zero algebraic number 
a # 1; wherever multiple values are involved, every such 
value is transcendental. 

Proof. The first three listed, e¢, + and e*, were treated 
in §1. Next, if sin a were algebraic, say sin a = a, we 
would have 

e'* — e* — Qiae® = 0, 


with algebraic coefficients and exponents, contrary to 
Theorem 9.1. A similar argument applies to the other 
trigonometric and the hyperbolic functions, each being 
written in terms of exponential functions. 

As for those in group (c) in the theorem, first suppose 
that log a = a with algebraic a. Then we would have 
e* = a, and by the earlier argument this is possible only 
in case a@ = 0,a = 1. The other values in group (c) can 
be treated similarly, the restriction a # 1 being needed 
in some cases. 
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5. Squaring the circle. One of the celebrated prob- 
lems of antiquity is to construct, by methods of straight- 
edge and compass, a square equal in area to a given circle. 
The impossibility of this construction was settled by Lin- 
demann when he proved that 7 is transcendental. For, 
on the one hand, all line segments that can be constructed 
from a given unit length by a finite number of straight- 
edge and compass constructions have lengths that are al- 
gebraic numbers. (We do not prove this assertion here, 
since there are many excellent accounts of such geometric 
constructions, references being given in the notes at the 
end of this chapter.) On the other hand, given any cir- 
cle, we may regard its radius as the unit of length, so 
that the circle has area 7 square units. So the problem of 
constructing a square of equal area is equivalent to the 
problem of constructing a line of length +/z from a given 
unit length. But this is impossible by the admissible 
procedures, because +/z is transcendental, since other- 
wise it would follow from Theorem 7.2 that 7 = Wz: 
vx is algebraic. 


Notes on Chapter 9 


The work of C. Hermite on the transcendence of e is in Compt. 
Rend. Acad. Sci. Paris, 77 (1873), 18-24; that of F. Lindemann on 
the transcendence of x in Math. Annalen, 20 (1882), 213-225. This 
paper of Lindemann is also the source of Theorem 9.1; a more de- 
tailed proof was given later by K. Weierstrass, Math. Werke, I 
(1895), 341-362. The result is sometimes called the Lindemann— 
Weierstrass theorem. 

In the proof of Theorem 9.1 we have used ideas from two recent 
papers on the subject: R. Steinberg and R. M. Redheffer, Pacific 
J. Math., 2 (1952), 231-242; Th. Skolem, Norske Vid. Selsk. For- 
handl., Trondhjem, 19, no. 12 (1947), 40-43. 

Proofs of the transcendence of e and = are not so difficult as the 
proof of the more genera] Theorem 9,1. For example, e was treated 
in Theorem 2.12 in a fairly simple way. Both e and 7 are discussed 
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in Hardy and Wright, The Theory of Numbers, Chapter XI, by 
methods avoiding complex integration; x is discussed by Niven, 
Amer. Math. Monthly, 46 (1939), pp. 469-471. 

We cite a few of the papers giving applications and extensions of 
the Lindemann theorem: Verne E. Dietrich and Arthur Rosenthal, 
Bull. Amer. Math. Soc., 55 (1949), 954-956; William J. LeVeque, 
Proc. Amer. Math. Soc., 2 (1951), 401-403; R. M. Redheffer, Amer. 
Math. Monthly, 60 (1953), 25-27. 

§ 5. Geometric constructions by means of straightedge and com- 
pass are discussed in detail in many books, for example the follow- 
ing: Richard Courant and Herbert Robbins, What Is Mathematics?, 
Chapter IIT, Oxford (1941); H. P. Hudson, Ruler and Compasses, 
Longmans, Green (1916); E. W. Hobson, Squaring the Circle, Cam- 
bridge (1913); J. W. A. Young, Monographs on Modern Mathematics, 
No. VIII by L. E. Dickson, Longmans, Green (1911). 


CHAPTER 10 


THE GELFOND-SCHNEIDER THEOREM 


1. Hilbert’s seventh problem. In 1900 David Hil- 
bert announced a list of twenty-three outstanding un- 
solved problems. The seventh problem was settled by the 
publication of the following result in 1934 by A. O. Gel- 
fond, which was followed by an independent proof by 
Th. Schneider in 1935. 


TurorEemM 10.1. Jf a and B are algebraic numbers with 
a £0, a £1, and if B is not a real rational number, then 
any value of a® is transcendental. 

Remarks. The hypothesis that “8 is not a real ra- 
tional number” is usually stated in the form “@ is irra- 
tional.”” Our wording is an attempt to avoid the sugges- 
tion that 8 must be a real number. Such a number as 
B = 2+ 31, sometimes called a “complex rational num- 
ber,” satisfies the hypotheses of the theorem. Thus the 
theorem establishes the transcendence of such numbers as 
2* and 2V2. In general, a® = exp {@loga} is multiple- 
valued, and this is the reason for the phrase ‘“‘any value 
of” in the statement of Theorem 10.1. One value of 
1+ = exp {—27 log 7} is e", and so this is transcendental 
according to the theorem. 

Before proceeding to the proof of Theorem 10.1, we 
state an alternative form of the result. 

134 
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THEOREM 10.2. If a and y are non-zero algebraic num- 
bers, and tf a # 1, then (log y)/(log a) is either rational or 
transcendental. 

To prove that this is an equivalent statement to Theo- 
rem 10.1, write 8 for (log y)/(log «), so that y = a°. To 
prove that Theorem 10.1 implies Theorem 10.2, let us as- 
sume that the hypotheses of the latter are satisfied, but, 
contrary to the conclusion, that 6 is algebraic but not 
rational. Then y is transcendental by Theorem 10.1, and 
we have a contradiction. 

Conversely, if we assume the hypotheses of Theorem 
10.1 but deny the conclusion thereof, we have an algebraic 
number @ which is not zero. From application of Theo- 
rem 10.2 we conclude that 8 is either rational or trans- 
cendental, and again we have a contradiction. Thus Theo- 
rem 10.2 implies Theorem 10.1. 

Theorem 10.2 implies that the common logarithms of 
positive rational numbers, i.e., logarithms of positive ra- 
tional numbers to base 10, are either rational or trans- 
cendental. This can be seen from the simple relation 


log r 
log 10 


Thus we can state a stronger result than Theorem 2.11: 
if r is any positive rational number not of the form 10” 
where 7 is a rational integer, then log; r is transcendental. 


logio T = 


2. Background material. In this section we list the 
results from algebra and analysis which will be needed in 
the proof of Theorem 10.1. We state some of the results 
as lemmas for convenience, giving references but no 
proofs. Although this prerequisite material is more ex- 
tensive than any used heretofore in this book, it is very 
central to the main stream of mathematics. We do need 
a couple of special results for the proof of the Gelfond- 
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Schneider theorem, and they will be given with proofs in 
the next section. 


LemMaA 10.3. Consider a determinant with the non-zero 
element p? in the j-th row and 1 + a-th column, with j = 1, 2, 
--+,tanda=0,1,---,t—1. This ts called a Vander- 
monde determinant, and it vanishes if and only tf p; = px 
for some distinct pair of subscripts j, k. 

This can be found in J. V. Uspensky, Theory of Equa- 
tions, McGraw-Hill, p. 214. The next four lemmas are in 
Harry Pollard, The Theory of Algebraic Numbers, John 
Wiley, p. 53, p. 60, pp. 63-66, p. 72. 


Lemma 10.4. Let a and 8 be algebraic numbers in a 
field K of degree h over the rationals. If the conjugates of 
a for K are a = a, ag, -*-, a, and for B are B = 1, Be, 
+++, Br, then the conjugates of o8 and a + B are aif, -*-, 
anB, and a; + Bi, +++, a + Br. 


Lemma 10.5. Jf @ is an algebraic number, then there is 
a@ positive rational integer r such that ra is an algebraic in- 
teger. 


Lemma 10.6. Jf K is an algebraic number field of degree 
h over the rationals, then there exist integers B1, Be, +++, Br 
in K such that every integer in K is expressible uniquely as 
a linear combination 9:8; +----+ grBn with rational inte- 
gral coefficients. The numbers 8; are called an integral basis 
for K, and the discriminant of such a basis 1s a non-zero 
rational integer. 


Lemma 10.7. If ais an algebraic number in a field K of 
degree h over the rationals, then the norm N(a), defined as 
the product of a and its conjugates, satisfies the relation 
N(aB) = N(a)-N(g). Also N(a) = 0%f and only if a = 0. 
If a is an algebraic integer, then N(a) ts a rational integer. 
If a is rational, then N(a) = a. 
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Finally, from complex variable theory we need the con- 
cept of entire function, i.e., a function that is analytic in 
the whole complex plane, and Cauchy’s residue theorem. 
These ideas can be found, for example, in K. Knopp’s 
Theory of Functions, vol. I, Dover, p. 112ff. and p. 130. 


3. Two lemmas. Lemma 10.8. Consider the m equa- 
tions in n unknowns 


(10.1) 
OnyXy + Axote +:°+++ Agntn = 0, k= 1, 2, +++, m, 


with rational integral coefficients a;;, and withO <m<n. 
Let the positive integer A be an upper bound of the absolute 
values of all coefficients; thus A = |a;;| for all it and j. 


Then there is a non-trivial solution x1, 2, - ++, Xn in rational 
integers of equations (10.1) such that 
las] <1 + (nAye—™, jg=1,2,--+,n. 
Proof. Write y;, for a,,21 +-+---+ Gentn 80 that to each 
point x = (21, %2, --+, Zn) there corresponds a point y = 
(Y1, Y2, -**, Ym). A point such as z is said to be a lattice 


point if its coordinates z; are rational integers. If x is a 
lattice point, then the corresponding point y is also a lat- 
tice point because the a,; are rational integers. Let q be 
any positive integer. Let x range over the (2q + 1)” lat- 
tice points inside or on the n-dimensional cube defined by 
|z;| <q for all subscripts 7, Then the corresponding 
values of y;, satisfy 


DD Oy syhj 
j=l 
Thus, as x ranges over the (2¢ + 1)” lattice points as 


indicated, the corresponding lattice points y have co- 
ordinates y, which are integers among the 2nAq+1 


lye| = S Dola,|-|2;| < > Aq = nAg. 
j=l 


j=l 
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values 0, +1, +2, ---, +nAg. Thus the lattice points 
y have at most (2nAgq + 1)” possible locations. We want 
to establish that at least two of these lattice points y co- 
incide, and this we can do by proving that for some q 
there are more points than there are possible locations: 
ie., that 


(10.2) (2g + 1)” > (Q2nAg + 1)”. 


In order to prove this we now specify the integer q: de- 
fine 2q as the unique even integer in the interval of length 
2 defined by 


(10.3) (nAy™/@-™ — 1 < 2g < (nA)™/“—™ 4 4, 


The first part of this inequality implies that (nA)” Ss 
(2qg + 1)", and so we have 


1 ™ 
(2nAq-+ 1)" = (nA)™ (24 ie —) < (nA)"(2q + 1)" 


S (2g + 1)*-™(2q + 1)" = (29 + 1)”, 


which is (10.2). Thus, as x ranges over the (2q¢+ 1)” 
lattice points defined by |2;| S$ qg, the corresponding lat- 
tice points y are not all distinct. Say that we have iden- 
tical points y corresponding to x = (xj, «++, 2,) and z = 
(ci, *+*,2,). Then x = (2, — 2}, +++, 2, — 2,) gives the 
non-trivial solution of (10.1) stated in the lemma, be- 
cause, by (10.3), 


|a; — x; | S [x] + laf] Sata < (Ave ™ +1. 


Notation. For any element a of an algebraic number 
field K, let || a || denote the maximum of the absolute 
values of a and its conjugates. By Lemma 10.4 we note 


that lo +6] S l}o|] + |] 6] and || 8 | s |] a]]-]] 6 II. 
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Lemma 10.9. Consider the p equations in q unknowns 

(10.4) 

one, + anebe +++++ akega = 0, k = 1,2, -+-, p, 


with coefficients a;; which are integers in an algebraic num- 
ber field K of finite degree. Assume thatO<p<q. Let 
A 2 1 be an upper bound for the absolute values of the co- 
efficients and their conjugates for K, thus A = || a;; || for all 
tandj. Then there exists a positive constant c depending on 
the field K but independent of a;;, p, and q, such that the 
equations (10.4) have a non-trivial solution &, &, -++, &q 
in integers of the field K satisfying 


HI &y | <et+ c(cqA)?!(a—?), k=1,2,-+-,p. 


Proof. Let h be the degree of K over the field of ra- 
tional numbers, and let 81, Be, ---, Bx, be an integral basis 
for the field. If a is any integer of K, then by Lemma 10.6 
we can express a uniquely as a linear combination of the 
integral basis, 


a = 918; + goB2 +-+-+ grBr, 


with rational integral coefficients g;. Denote the conju- 
gates of a for K by a = a, a, ---, a, and similarly 
for the 6;. Taking conjugates in the last equation, by 
Lemma 10.4 we get 


a) = 98) + 9288 BS eda gnB, i= 1,2, +++, h. 


The determinant |g{| is the discriminant of the basis, 
and it is not zero by Lemma 10.6. Hence we can solve 
these equations for the g; as linear combinations of the 
a”, with coefficients dependent only on the basis. Tak- 
ing absolute values throughout these solutions, we can 
write 


(10.5) A <cy I a ||, j=1,2,---,h, 


140 THE GELFOND-SCHNEIDER THEOREM Ch. 10 


where the positive constant c; is dependent on the field K 
but independent of a. (In fact, since the discriminant of 
an integral basis is a non-zero rational integer by Lemma 
10.6, we can specify a value for c,: namely h times the 
maximum of the absolute values of the first minors of the 
determinant |¢{|. However, no use will be made of this 


specific value.) 
To get the integers £; which satisfy the equations (10.4), 
we write them in terms of the integral basis, 


h 
(10.6) & = D) 25;6;, t=1,2,-++,¢. 
j=l 


Thus the problem becomes that of finding suitable ra- 
tional integers z;;. Equations (10.4) can be rewritten as 


(10.7) 
qg g h 
Dd nits = DX DY auhjaij = 0, k=1,2,+-+,p. 
i=1 f=1 jel 


Now the integers a;;8; can, by Lemma 10.6, be expressed 
in terms of the integral basis, say 


h 
(10.8)  axiBy = MrijrBr, 
1 


r= 
k= 1, “tty Dy t= 1, n'y qd, j= 1, ee 
with rational integers m,:j,. Hence (10.7) becomes 
qd h h 
De dy Ly MasirteiBr = 0, k= 1,2, ---,p. 
Now the 8, are linearly independent over the rational 


numbers, and so we set the coefficient of each 6, equal to 
zero, 
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qa h 
(10.9) > » Mijrliz = 0, 
t=1 j=l 
k = 1,2, +--+, p, r=1,2,---,h. 


Here we have ph equations in the gh unknowns 2;;, and 
we will apply Lemma 10.8. In order to do this, we need 
an upper bound on the absolute values of the coefficients 
Mxijr. We apply the result (10.5), with a replaced by 
ai8;, and g; replaced by mxz;jr as in (10.8), whence 


| mrizr| < €1 || cxs8; I] S e1 |] oes || - Il Bs II 


= cyA I B; I = CA, 


wherein A is the bound given in the statement of the pres- 
ent Lemma 10.9, and cz is a positive constant chosen not 
only to satisfy cg 2 ¢, || 8; || for all 7 but also so that c2.A 
is a rational integer. This last requirement makes c, de- 
pendent on A, but only in a trivial way which does not 
affect the order of magnitude of cp: since A 2 1, the real 
number cz can be chosen in the interval 
1 + ¢-max || B; || > ¢2 2 cy-max | 8; || 
] ] 

so that c2A is a rational integer. 

Hence we can apply Lemma 10.8 to equations (10.9) 
with m,n, and A replaced by ph, gh, andc2A. Thus there 
is a non-trivial solution of (10.9) in rational integers 2;; 
satisfying 


ijl < 1 + (qhegA)?* (ah—Ph) =1 + (hegg A)?! (4-?), 
] 


Substituting these estimates in (10.6), we get a solution of 
(10.4) which satisfies the inequalities 


Il él < h-max By {1 + (heag A)?! 9? } 
j 


<et+ c(cqA)?! @—?) , 
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provided that c is chosen to exceed hc and h || 8; || for all 
j. Thus c depends on the field K, but is independent of 
a;;, p, and g. Furthermore, since at least one 2,; is not 
zero, it follows from (10.6) that at least one é; in this solu- 
tion is not zero, because the 8; are linearly independent 
over the rational numbers. 


4, Proof of the Gelfond-Schneider theorem. We 
assume that there are two algebraic numbers a and 8 sat- 
isfying the hypotheses of Theorem 10.1, but, contrary to 
the conclusion of that theorem, we assume that a? is al- 
gebraic. We prove that these assumptions lead to a con- 
tradiction. Writing y for a = exp (8 log a), we define K 
as a finite algebraic extension field of degree say h over 
the rational numbers which contains a, 8, and y. We 
now write a collection of simple definitions and relation- 
ships for reference, and then we will explain them: 


(10.10) m=2h+3, q>4m?, n= q?/2m, 
=g=2mn, n>4g@. 


The first is a definition of m, so that m and h are fixed 
throughout the discussion. The integer ¢ is chosen larger 
than 4m”, and so that g’ is a multiple of 2m, the integer 
n being the quotient of these. Then ¢ is defined as q’, 
and the last relation, n > g, follows from the others. 
These specifications on g, n, ¢ will be supplemented by 
one further requirement later: namely that these integers 
be sufficiently large to exceed certain constants. These 
constants will be denoted by c, c;, C2, ¢3, ---, all of them 
independent of n, q, t. 
Define p1, p2, «++, p¢ aS the numbers 


(10.11) 
(r+kB)loga for r=1,2,-+-,g, k=1,2,-++,¢. 
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There is no need to specify exactly which of these is 1, 
which pe, etc. Define the entire function 


t 
(10.12) FQ) = D ayexp (o)), 
j=l 
where the 7; are algebraic integers in the field K to be 
specified presently. 
We use Lemma 10.5 to get a rational integer c,; > 0 so 
that c,a, c,6, and c,y are algebraic integers. Then con- 
sider the mn equations in the 2mn unknowns 3;, 


(10.13) cht?" 4 (log a)—*F@ (b) = 0, 
a=0,1,:--,n-—1, b= 1,2, -++,m. 


We will apply Lemma 10.9 to these equations, but in 
order to do so we must verify that the hypotheses of that 
lemma are satisfied. The coefficient of 7; in any of the 
equations (10.13) is, by (10.11), 
(10.14) cf*?"4(log a) “05 exp (bp,) 
= oft?" (r + kp)* exp {b(r + kB) log a} 
= cht 2nd (p + kB)tarPy*?, 
This relation establishes that the coefficients of 7; in 
(10.13) are algebraic integers in K. To see this, note that 
the last expression in (10.14) is a polynomial in a, 8, y of 
degree a-+7rb+kb. But the maxima of a, b, r, k are 
n—1, m, q, g, respectively, so thata+7rb+ kb Sn — 
1+ 2mg. Thus the factor c+?” guarantees that (10.14) 
is an algebraic integer. 


To find a bound for the coefficients (10.14) and their 
conjugates, we observe that 


(10.15) [|r +e i] Siir ll] +e il-le ll 
Sq+q|8]} =a{1+ lef}. 
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Define cz as the maximum of || a ||, || v ||, 1 + || 6 ||, and 
we can say that the absolute value of any coefficient 
(10.14) or of any conjugate thereof is bounded by 


er?" (gcy)"cz"™" = (cyea)"{ (exe)? }*(-V 2m )"n”!?, 


by (10.10). Define cz as (¢y¢g)2"t?-W2m, and this bound 
can then be replaced by cjn"/? since gq <n. Note that 
C3, like ci, C2, and m, is independent of n. 

We can now apply Lemma 10.9 to (10.13) with A re- 
placed by cjn”/?, and we conclude that these equations 
have a non-trivial solution 7; such that, for every j, 


Il 13 || < ¢ + e{c(Qmn)ogn”!? mn! Gm —mn) 
= ¢ + 2c?mnegn"!? < 38c?mnczn"!?, 


The constant c is dependent on the field K but not on 
n. Now 27° >n>q>™m; so we can replace mn by 4” 
in the last inequality, and combine all constants to write 


(10.16) Il 23 || < ein”/?, 


where cq is independent of n. This non-trivial solution 
nj in integers of the field K of equations (10.13) is used 
in (10.12), so that F(z) is now completely defined. 


Lemma 10.10. There exist integers p = n and B in the 
range 1 < B<m such that F(b) = 0 fora = 0,1, ---, 
p — landb = 1, 2, ---, mand F®(B) ~ 0. 

Proof. If such an integer p exists, it certainly satisfies 
p =n because of (10.13). It will suffice to prove that 
F (1) does not vanish for all the values a = 0, 1, 2, 3, 
-»+, — 1. Assume that F(1) does vanish for these 
values, so that by (10.12) 


t 
Do 150; exp (0;) = 0, OSasit-1l. 
‘a1 
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But the 7; are not all zero, so we obtain a vanishing de- 
terminant, 


O = det | pf exp (o,)| = det |p?| - [J exp (0;), 0 = det lpsl. 
Jj 


We get the latter equation because exp (p;) #0. By 
Lemma 10.3 the vanishing of this Vandermonde determi- 
nant implies that two of the p’s are equal, say p; = px. 
By (10.11) and the fact that log a ¥ 0 by hypothesis, it 
follows that @ is rational. But this contradicts the hy- 
potheses of Theorem 10.1, and so the lemma is proved. 
Next we use Lemma 10.10 to define the non-zero value 


(10.17) 
t 
g = (log a)? F (B) = D7 nj(log a)~?pf exp (Bp;) 


j=l 


t 
= Liar + kpyrabry™, 
jul 
the latter forms stemming from (10.12) and (10.11); the 
values of r and k in the last. sum depend on J, as in (10.11), 
but the exact relationship will not be needed in what 
follows. 


Lemma 10.11. There exists a positive constant C, inde- 
pendent of n and p, such that 


ING)| 2 C7? 


Proof. Recall that the n; are integers in K, and that 
the rational integer c,; was chosen so that cya, c18, cry are 
integers in K. Hence c?t?”% is an algebraic integer in 
K, by (10.17) and the facts that g is the maximum value 
of both r and & in (10.11), and m is the maximum value 
of B by Lemma 10.10. Now ¢g <n S p so that 


p+2 142: 
egret < fey YP = 6, 
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where cs like c, and m, is independent of p and n. Also 
c§¢ is an algebraic integer, so by Lemma 10.7 we have 


1s [N(E&)| = IN(CE)NGO)| = BINGO), 
IN()| = ()-?. 
This establishes the lemma since h is also independent of 
n and p. 


Lemma 10.12. There exists a positive constant c, inde- 
pendent of n and p, such that || ¢ || < c?p?. 
Proof. By (10.17) we have 


Il ll  e-max {If 5 [ll] 7 8 HP ll @ (PPh y 174), 
a] 
where again the values of r and k are dependent on j, as 
in (10.11), but the precise relation is of no consequence. 
Now g<nsSp, and ¢ = 2mn < 2” for n sufficiently 


large. We can use (10.16), and replace r, k, B, || a|/, || + |l, 
1 + || @|| by their maxima g, g, m, co, c2, cz to get 


II FI] < 2"ehn"!?{qco}Pex™%en™* < {2eqon*?"}?n”/2q?, 
Also by (10.10) we have 
g? = (V/2m)?n?!? < (4/2m)?p?!? and n”/? < pri? 
Applying these to the previous inequality we get 
El] < {2eqegt?”4/2m}?p? = cP p?. 


By Lemma 10.10 the entire function F(z) defined in 
(10.12) has zeros of order at least p at the points z = 1, 
2, +--+, m. Hence S(z) defined as follows is also an entire 
function, 


(10.18) Sz) = p!F(z) Il (zg ~ b)7? II (B — b)?. 
b=1 b=1 
b+B 
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Expanding F(z) in a Taylor series of powers of z — B, 
(2- B)?F)(B) (z — B)? tl pet) (B) 

p! (p+ 0)! - 


substituting this in (10.18), and then setting z = B, we 
get 


(10.19) S(B) = F®(B), § = (log a) *S(B), 


the latter from (10.17). By Cauchy’s residue theorem 
we obtain 


F(z) = 


(10.20) Spyies 
- Qido2 —B 


’ 


for any simple closed curve C around the point z = B. 
We shall take C' to be the circle |z| = p/g. This encloses 
the point z = B because, by (10.10), 
n 
(ei: -See 22 Snee 
q 2q 2q 4m 

We now get an upper bound on ¢ by use of (10.19) and 
(10.20). If uw is any complex number, then |{expu| S 
exp |u|. We use (10.11) to write the following inequality 
for all z on the circle |z| = p/gq, 


lexp (zp;)| S exp ([zp;|) 
p 
S exp Pe + @{Bl)- og al} = 6, 
q 


ce = exp {(1 + |8])-|log a}. 


The constant cg, like all others in this analysis, is inde- 
pendent of n and p. Hence by (10.12) and (10.16) we 
have, for |z| = p/g, 


(10.22) | F(z)| S tegn”/cB < (2cqcg)?n”!? < ofp? ’?, 
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wherein we have used t = 2mn < 2” < 2? for nsufficiently 
large. Next, for any b = 1, 2, ---, m, we use (10.21) to 
obtain 


(10.23) jz ~b| = lz} — |b} 22 —-meZk, 
q 2g 


A 


2 Pp 
lz — b|-? (“) : 
Dp 
We now apply (10.22) and (10.23) to (10.18) and con- 
clude that, on the circle {z| = p/q, 


QWq\™P _m 
IS@)| < plop??? C) Il |B 3 


b=1 
beB 
m Vin"? 
= {c72"(2m)™/? TT |B — b| }?p!p?/? (~*) 
bs} e 


ll 
° 
ou 
= 
3 
Te 
re) 
| 


Now p! < p?, and V n/p < Vp since n S p; so we 
have 


(10.24) |S) |< cBpre-™?, 


for all z on the circle |z! = p/g. Finally we turn to 
(10.19) and (10.20), and conclude that 
8 
f ©) » } 
CZ B 


The length of the path of integration is 2rp/q, and so we 
can apply (10.24) and (10.23) to write 


i 
Is] < |log a|~?-[S(B)| Sa OR CICT 


Tv 
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—p Pp pia—my2 , 24 
[f| < {log a| ee os 


< {2cg|log a|~"}?p?o—™/? 


= chpP3—m)/2, 


With this estimate for |¢|, and that of Lemma 10.12 for 
its conjugates, we write, by (10.10), 


IN (S) | < chp?B—™!2 (cPpP)h-l = (egchl)Py—P = CBp-?, 
where Co = coc’. This and Lemma 10.11 imply that 
chp ? > C?, Ceo > PD, 


for some positive constants independent of n and p. But 
this is a contradiction, because p = n, and we can choose 
n arbitrarily large. 


Notes on Chapter 10 


The special case of Theorem 10.1 for any imaginary quadratic 
irrational 6 was established by A. O. Gelfond, Compt. Rend. Acad. 
Sct. Paris, 189 (1929), 1224-1226. The original papers establish- 
ing Theorem 10.1 are: A. O. Gelfond, Doklady Akad. Nauk S.8.S.R., 
2 (1934), 1-6; Th. Schneider, J. reine angew. Math., 172 (1935), 65- 
69. The American Mathematical Society has provided an English 
translation (Translation Number 65) of an advanced expository pa- 
per by A. O. Gelfond, The approximation of algebraic numbers by 
algebraic numbers and the theory of transcendental numbers, Uspehi 
Mat. Nauk (N.S.), 4, no. 4 (32), 19-49 (1949). There is an exposi- 
tion of Gelfond’s proof by E. Hille, Amer. Math. Monthly, 49 (1942), 
654-661. 

The proof of Theorem 10.1 given here is based on a simplification 
of Gelfond’s proof by C. L. Siegel, Transcendental Numbers, Prince- 
ton, pp. 80-83. 

Although the methods of Chapters 9 and 10 establish the trans- 
cendence of wide classes of numbers, there are many unsolved prob- 
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lems here. For example it is not known whether Euler’s constant 
is irrational, let. alone transcendental. 

Siegel’s monograph cited above is the most complete statement 
of our present knowledge of transcendental numbers, especially 
those numbers that arise from solutions of linear differential equa- 
tions. Finally, we cite some work of a different sort, wherein cer- 
tain classes of continued fractions are shown to be transcendental: 
C. L. Siegel, Abhandl. preuss. Akad. Wiss., Phys.-math. Kl., no. 1 
(1929), 29; G. C. Webber, Bull. Amer. Math. Soc., 50 (1944), 736- 
740. 
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LIST OF NOTATION 


The g.c.d., or greatest common divisor, of the in- 
tegers h and k. The same notation is used for the 
interval from h to k on the real line, where h and k 
are any real numbers. 

The greatest integer less than or equal to 2, i.e., 
the unique integer n satisfying the inequality n S 
za<nt+l. 

The fractional part of z; thus (x) = x — [x]. 
Euler’s ¢ function. 

The finite field having the p elements 0, 1, 2, ---, 
p —1, with addition and multiplication defined 
modulo the prime number p. 

The set of all polynomials in x with coefficients in 
Jp. 

The nth cyclotomic polynomial, i.e., the monic 
polynomial whose zeros are the primitive nth roots 
of unity. 

The field of rational numbers. 

The degree of the field K over the field F, it being 
presumed that K contains F. 

The field obtained by extending F by the adjunc- 
tion of a. F(a) consists of the set of all rational 
functions f(a)/g(a), where f and g are polynomials 
with coefficients in F, and g(a) ¥ 0. 

A finite continued fraction, whose value can be de- 
fined recursively as [x0, 21, +++, In—2, Zn—1 + 1/Zal. 
An infinite continued fraction, with value lim [2, 
T2, °°, Zn]. 

Is contained in. 

Belongs to, is a member of. 
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LIST OF NOTATION 


The union of; A U B is the set of elements z such 
that z is a member of at least one of the sets A, B. 
d is a divisor of n. 

d is not a divisor of n. 

Same as e*, the exponential function. 

The maximum of. 

The maximum of the absolute values of the alge- 
braic number a and its conjugates. 


GLOSSARY 


The purpose of this glossary is to supplement, rather than replace, 
the index. Many of the terms here are not defined elsewhere in 
this monograph. 


algebraic integer. An algebraic number of a special type, namely one 
that satisfies some equation 2” + ayz7~1 +---+a, =0 with 
rational integral coefficients. The minimal polynomial of an al- 
gebraic integer is also monic with integral coefficients. 

algebraic number. Any complex number that satisfies some equation 
of the form x” + ayz"~! +---+ a, = 0 with rational coefficients. 

almost all. A set S contains almost all real numbers in an interval 
I if the real numbers (or points) of J not belonging to S constitute 
a set of measure zero. 

Archimedean property. If a and 8 are any two positive real num- 
bers, then there exists a positive integer n such that na > 6. 

basis of a field. See finite extension field. 

closed interval. See interval. 

convergent. See n-th convergent. 

countable set. A set that can be put into one-to-one correspondence 
with the natural numbers 1, 2, 3, ---. See the notes on Chapter 1 
for a variation of this definition that is sometimes used. 

cyclotomic polynomial. The nth cyclotomic polynomial F(z) is the 
unique monic polynomial whose zeros are the nth primitive roots 
of unity. 

degree of a field. See finite extension field. 

degree of an algebraic number. The same as the degree of the mini- 
mal polynomial of the algebraic number. 

denumerable set. One that can be put into one-to-one correspond- 
ence with the natural numbers 1, 2, 3, ---. 

division algorithm. Given rational integers a and b > 0, one obtains 
integers g and r such that a = bg +r withO Sr <b. 

Eisenstein irreducibility criterion. Let p be a prime and f(z) = 
ax” + ayz"—! +---+ a, a polynomial with integral coefficients 
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such that p } ao, p? ¥ an, pla; fori = 1,2, -+-,. Then f(z) is 
irreducible over the rational numbers. 
elementary symmetric polynomials. The elementary symmetric poly- 
nomials (or functions) of 21, z2, +--+, 2, are the polynomials o1, 2, 
++, on defined by the identity 


i (y — 22) = y™ — ory®! F agy™? +--+ + (1) "on. 


Euler o function. For any positive integer n, ¢(n) is the number of 
positive integers j S n such that the g.c.d. (j, n) = 1. Its value 
is g(n) = n(l ~ pr). — pa) --- (1 — p;), where the distinct 
prime factors of n are p1, Po, °° +, Dr. 

everywhere dense. A set S of real numbers is everywhere dense in 
an interval I if, given any two numbers a and 6 in J, say with 
a < @, there is a number s in S such that a <8 < 8. 

field. A-system S of elements a, b, c, etc., in which a sum a + b and 
a product ab are defined for any pair of elements, with the follow- 
ing properties satisfied by all elements: (1) a + b and ab are in S; 
(2)a+(b+c) = (a +5) +€ and a(bc) = (ab)c; (3) there exist 
elements 0 and 1 in S such that 0 +a =a+0 =aand1l-a= 
a-1 = qa; (4) to any element @ there correspond elements —a and 
a—' (only the former in case a = 0) such that (—a) +a =a+ 
(—a) = 0 and a-a = a-a = 1; (5)a +b =b +a and ab = 
ba; (6) a(b +c) = ab + ac. Examples of fields are: the rational 
numbers; the real numbers; the complex numbers; the finite field 
denoted by Jz consisting of the numbers 0, 1, 2, ---, p — 1 where 
p is a prime number, addition and multiplication being defined 
modulo p. 

finite extension field. A field K is said to be a finite extension of a 
field F if K contains F and if there exists a finite set of elements 
in K, say ky, ke, «--, kn, such that every element in K is expressi- 
ble as a linear combination of these elements, 2a;k;, with coeffi- 
cients a; in F. If no fewer than n elements of K can serve thus, 
then n is the degree of K over F, and the elements ky, ko, -+-, kn 
are said to form a basis for K over F. Stated otherwise, the degree 
of K over F is the maximum number of elements of K that are 
linearly independent over F. 

finite field. A field with a finite number of elements. 

fundamental theorem of arithmetic. Any positive integer except 1 
can be written as a product of prime factors which is unique apart 
from the order of the factors. 
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fundamental theorem on symmetric polynomials. Any symmetric 
polynomial f(z1, t2, +++, Zn) with coefficients in a field F is ex- 
pressible as a polynomial p(o1, 2, --+, on) in the elementary sym- 
metric polynomials with coefficients in F. If the coefficients of f 
are rational integers, so are the coefficients of p. 

Gauss's lemma. If a polynomial f(z) with rational integral coeffi- 
cients can be factored into two polynomials, f(z) = g(x) A(z), with 
rational coefficients, then there is a non-zero rational number r 
such that r g(x) and r— h(x) have rational integral coefficients. 

g.c.d., greatest common divisor. The g.c.d. of two integers a and b 
(not both zero) is the largest positive integer that divides both a 
and 5); it is denoted (a, b). 

half-open interval. See interval. 

homogeneous polynomial. The polynomial f(2x1, 22, -*+, Zn) is homo- 
geneous of degree m if f(dx1, Ate, «++, ATn) = A“ (Zi, Le, +++, Tn). 

interval. If a and 8 are any two real numbers, say with a < @, the 
real numbers or points z satisfying a < z < 8 form an open inter- 
val, a Sz S$ a closed interval, anda <2 SforeSzx<Ba 
half-open interval. 

trrational number. A real number that is not rational. 

irreducible polynomial. One that is not reducible. 

lattice point. In n-dimensional space, any point (11, @2, °+*, Xn) 
having integral coordinates. 

linearly dependent. The values a), a2, «++, ay are linearly depend- 
ent over a field F if there exist elements ci, co, --+, Cn in F, not all 


n 

zero, such that > cio; = 0. Otherwise they are linearly inde- 
pendent. tel 

measure zero. A set of real numbers or points is said to have meas- 
ure zero if it is possible to cover the points of the set with a col- 
lection of intervals of arbitrarily small total length. 

minimal polynomial of an algebraic number. The unique monic poly- 
nomial of least degree with rational coefficients which has the al- 
gebraic number as a zero. 

monic polynomial. One having 1 as the coefficient of the term of 
highest degree. 

n-th convergent. The nth convergent to the continued fraction [z, 
21, Ze, -+-} is the finite continued fraction [zo, 21, +++, tn). 

open interval. See interval. 

partial quotient. See simple continued fraction. 

primitive n-th root of unity. An nth root of unity whose powers give 
all the nth roots of unity. Such a number is expressible as e?"™*/" 
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= cos (2xk/n) + 7 sin (2rk/n) for some integer & relatively prime 
to n. 

rational number. One that can be written as the quotient of two 
integers; thus h/k with k ¥ 0. 

reducible polynomial. A polynomial f(z) with coefficients in a field 
F is reducible over F if it can be factored non-trivially into two 
polynomials g(x) and A(x) with coefficients in F, f(z) = g(x) h(x). 
By “non-trivially” we mean that each of g(z) and h(x) has degree 
at least one. 

simple continued fraction. The continued fraction [z, 21, z2, ---] is 
simple if each partial quotient x; is an integer, positive except per- 
haps when ¢ = 0. 

symmetric polynomial. f(x1, 22, -++, Zn) is symmetric in 2, v2, +--, 
zy if any permutation of the 2; leaves f invariant. 

transcendental number. A complex number that is not algebraic. 

triangle inequality. Any two complex numbers a and 6 satisfy 


la+6| Sle] + [6] and ja —6| 2 fel — [6]. 
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INDEX OF TOPICS 


Absolutely norma] number, 116 
Algebraic integers, 29 
everywhere dense property, 
85 
Algebraic number, 28 
degree of, 28 
order of approximation of, 90, 


Algebraic numbers, closure prop- 
erties, 84 
countability of, 87 
form a field, 84 
Approximation, best possible, 62, 
68, 70, 81, 93 
of an irrational number, 42, 
44, 61, 68, 70 
of several] numbers simultane- 
ously, 45, 47, 48 
order of, 88 
to a complex irrational num- 
ber, 81 


Complex irrational number, 81 
Complex rational number, 83 
Conjugate algebraic numbers, 
122 
Continued fraction, 52 
finite, 52 
infinite, 57 


Continued fraction, periodic, 63 
simple, 52 
Convergent to a continued frac- 
tion, 57 
Countability, 4 
alternative definition, 14 
of algebraic numbers, 87 
of rational numbers, 4 
Cyclotomic polynomial, 33 
coefficients of, 34 
irreducibility of, 36 


Decimal! expansion, 6 

Degree of an algebraic number, 
28 

Dense set, 5 

Denumerable set, 4 


e, irrationality of, 11 
transcendence of, 25 
Euclidean algorithm, 51 
Euler ¢ function, 30 
Euler’s constant, 150 
Everywhere dense set, 5 
Exponential function, irrational 
values of, 23 
linear independence of, 117 
transcendental] values of, 131 
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Finite continued fraction, 52 


Gauss’s lemma, 29 
Gelfond—Schneider theorem, 134 


Hilbert’s seventh problem, 134 
Hurwitz’s theorem, 68, 70 
Hyperbolic functions, irrational 
values of, 22 
transcendental values of, 131 


Incommensurable numbers, 14 
Infinite continued fraction, 57 
Integral basis, 136 
Irrationality, criteria for, 10, 12, 
15, 44, 60, 75 
of e, 11 
of exponential function, 23 
of hyperbolic functions, 22 
of logarithms, 23, 24 
of ~/m, 16 
of x, 19 
of trigonometric functions, 17, 
21, 41 
Trrational number, 1 
uniform distribution of mul- 
tiples of, 72 
Trrational numbers, everywhere 
dense, 5 
measure of, 2 
not countable, 5 
Irreducibility of cyclotomic 
polynomial, 36 


Kronecker’s theorem, 82 


Lattice point, 44 

Lindemann theorem, 117 

Linear independence of exponen- 
tial functions, 117 
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Linearly dependent numbers, si- 
multaneous approximation 
to, 48 
Liouville number, 91 
transcendence of, 92 
Logarithms, irrationality of, 23, 
24 
transcendence of, 131, 135 


Measure, of algebraic numbers, 
87 

of irrational numbers, 2 

of normal numbers, 103 

of rational numbers, 2 

of transcendental numbers, 88 
Minimal] polynomial, 28 
Monic polynomial, 28 


Normal field, 121 
Normal numbers, definition, 95, 
96 
measure of, 103 
uniform distribution property, 
110 
Norm of an algebraic number, 
136 


Order of approximation, 88 
of an algebraic number, 90, 
93 
of a rational number, 89 


Partial quotient, 52 

Periodic continued fraction, 63 

Periodic decimal, 12 

Pi, irrationality of, 19 
transcendence of, 117 

Pigeon-hole principle, 43 

Primitive roots of unity, 30 


Quadratic irrational, 64 
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Rationality, criteria for, 10, 12, 
44, 60, 75 
Rational number, 1 
order of approximation, 89 
Rational numbers, countability 
of, 4 
everywhere dense, 5 
measure of, 2 


Simple continued fraction, 52 
Simply normal number, 94 
Squaring the circle, 132 


Transcendence, of e, 25 
of Liouville numbers, 92 
of logarithms, 131, 135 
of w, 117 
of values of various functions, 
131 
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Transcendental numbers, defini- 
tion, 29 
measure of, 88 
Trigonometric functions, alge- 
braic values of, 29, 37, 39 
irrational values of, 17, 21, 41 
rational values of, 41 
transcendental values of, 131 


Uniform distribution, 71 
of multiples of an irrational 
number, 72 
property of normal numbers, 
110 
Wey] criterion for, 76, 82 
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Weyl criterion for uniform distri- 
bution, 76, 82 
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